(19) 



(12) 



Europaisches Patentamt 
European Patent Office 
Off Ice europ^en des brevets (11) EP 0 643 778 B1 

EUROPEAN PATENT SPECIFICATION 



(45) Date of publication and mention 
of the grant of the patent: 
31.05.2000 Bulletin 2000/22 

(21) Application number: 93908732.6 

(22) Date of filing: 30.03.1993 



(51) IntCI. 7 : C12Q1/70, C07K2/00, 
Q01N 33/53 

(86) International application number: 
PCT/US93/03127 

(87) International publication number: 

WO 93/20242 (14.10.1993 Gazette 1993/25) 



(54) ENCODED COMBINATORIAL CHEMICAL LIBRARIES 

KODIERTE KOMBINATORISCHE CHEMISCHE BIBLIOTHEKEN 
BIBLIOTHEQUES CHIMIQUES COMB I N ATOI RES CODEES 



(84) Designated Contracting States: 

AT BE CH DE DK ES FR GB GR IE IT U LU MC NL 
PTSE 

(30) Priority: 30.03.1 992 US 860445 

(43) Date of publication of application: 
22.03.1995 Bulletin 1995/12 

(73) Proprietor: 

THE SCRIPPS RESEARCH INSTITUTE 
LaJofla, CA 92037 (US) 

(72) Inventors: 

• LERNER, Richard 

La Jofla, CA 92037 (US) 

• JANDA, Khn 

San Diego, CA 92107 (US) 

• BRENNER Sydney 

Edwards Passage, Cambridge CB2 3PJ (GB) 

• NIELSEN, John 

San Diego, CA 92122 (US) 

(74) Representative: 
Hsher, Adrian John 
CARPMAELS & RANSFDRD 
43 Bloomsbwy Square 
London WC1 A 2RA (GB) 



(56) 



References cited: 
EP-A- 0 323 152 
WO-A-92/00091 
WO-A-93/06121 
US-A-4923 901 



WOA-90/D0626 
WO-A-92/09300 
WO-A-93/09668 
US-A-4 965188 



SCIENCE, voL251 , 1 5 February 1 991 pages 767 - 
778 SwPJLFODOER ET AL. 'Light-Directed, 
Spatially Adressable Parallel Chemical 
Synthesis/ 

Nature, Volume 354, issued 07 November 1991, 

LAM et aL, "A New Type off Synthetic Peptide 

Library tor identifying Ugand- Binding", pages 

82-84y see entire documenL 

PJLA&, Volume 81, issued 1984, GEYSEN et aL, 

"Use of Peptide Synthesis to Probe Viral 

Antigens for Epitopes to a Resolution of a Single 

Amino Add", pages 399*4002, see Abstract and 

Materials and Methods. (Synthesis of Peptides). 

Tetrahedron Letters, Volume 26, issued 1985, 

HAYES et at, Tfgh-Yteld Synthesis of 

OQgoribonucieotftdeste 

Protection of r-Hydraxyts", pages 2407-2410, 

see page 2407, Summary. 

Nature, VWume 354> issued 07 November 1991, 

HOUGHTENetaL, "Generation and Use of 

Syntiettc Peptide Combinatorial Libraries for 

Basic Research and Drug Discovery, pages 84- 

86, see page 84. 



Note: Within nine months from the publication of the mention of the grant of the European patent, any person may give 
notice to the European Patent Office of oppos i tion to the European patent granted. Notice of opposition shall be filed in 
a written reasoned sta t em e n t It shall not be deemed to have been fied urtH the opposition fee has been paid. (Art 
99(1) European Patent Convention). 



Prtmed by Xerox (UK) Business Services 

2.16.7 (HRSV3.6 



C o43 778 B1 



Description 
Technical Field 

5 [0001 ] The present invention relates to encoded chemical libraries that contain repertoires of oligopeptide/ofigonu- 
cleotide conjugates defining a diversity of biological structures, and methods for using the Itoraries. 

Background 

10 [0002] There is an increasing need to find new molecules which can effectively modulate a wide range of biological 
processes, for appfications in medicine and agriculture. A standard way for searching for novel bioactrve chemicals is to 
screen collections of natural materials, such as fermentation broths or plant extracts, or libraries of synthesized mole- 
cules using assays which can range in complexity from simple binding reactions to elaborate physiological prepara- 
tions. The screens often only provide leads which then require further improvement either by empirical methods or by 

is chemical design. The process it time-consuming and costly but it is unJikery to be totally replaced by rational methods 
even when they are based on detailed knowledge of the chemical structure of the target molecules. Thus, what we 
might call "irrational drug design" - the process of selecting the right molecules from large ensembles or repertoires - 
requires continual improvement both in the generation of repertoires and in the methods of selection. 
[0003] Recently there have been several developments in using peptides or nucleotides to provide libraries of com- 

20 pounds for lead discovery. The methods were originally developed to speed up the determination of epitopes recog- 
nized by monoclonal antibodies. For example, the standard serial process of stepwise search of synthetic peptides now 
encompasses a variety of highly sophisticated methods in which large arrays of peptides are synthesized in parallel and 
screened with acceptor molecules labelled with fluorescent or other reporter groups. The sequence of any effective 
peptide can be decoded from its address in the array. See for example Geysen et al., Pry? Ngtt A^ yi.sci.USA. 81 3998- 

25 4002 (1984); Maeji et al., J. Immunol. Met. 14653-90 (1992); and Fodor et al., Science. 251: 767-775 (1991). 

[0004] In another approach, Lam et al.. Nature. 354:82-84 (1 991 ) describes combinatorial Ifcraries of peptides that 
are synthesized on resin beads such that each resin bead contains about 20 pmoles of the same peptide. The beads 
are screened with labelled acceptor molecules and those with bound acceptor are searched for by visual inspection, 
physically removed, and the peptide identified by direct sequence analysis. In principle, this method could be used with 

30 other chemical entities but it requires sensitive methods for sequence determination. 

[0005] A different method of solving the problem of identification in a combinatorial peptide Iforary is used by 
Houghten et aL, Nature. 354:84-86 (1991). For hexapeptides of the 20 natural amino acids, 400 separate fibraries are 
synthesized, each with the first two amino acids fixed and the remaining four positions occupied by all possfole combi- 
nations. An assay, based on competition for binding or other activity, is then used to find the Iforary with an active pep- 

35 tide Then twenty new libraries are synthesized and assayed to determine the effective amino acid in the third position, 
and the process is reiterated in this fashion unffl the actrahexapeptide is denned 
in searching a dictionary; the peptide is decated by constnxtion using a sot 
search logarrthrric. 

[0006] A very powerful biological method has recently been described in which the Iforary of peptides is presented 
40 on the surface of a bacteriophage such that each phage has an individual peptide and contains the DNA sequence 
specifying it The Iforary is made by synthesizing a repertoire of random digonudeotides to generate all oyrixnations, 
tol towed by their insertion into a phage vector. Each of the sequences is cloned in one phage and the relevant peptide 
can be selected by finding those that bind to the particular target The phages recovered in this way can be ampfif ied 
and the selection repeated. The sequence of the peptide is decoded by sequencing the DNA. See for example Cwirta 
45 et aL. Proc.NatLAcad.Sci USA, 87:6378-6382 (1990); Scott etaL, Science. 249386-390 (1990); and Devin etaLSri- 
fiOCfi. 249:404-406 (1990). 

[0007] Another "genetic* method has been descrfoed where the Rxaries are the synthetic ofigonudeotides them- 
selves wherein active o6gonudeotkJe molecules are selected by bcndmg to an acceptor and are then amplified by the 
polymerase chain reaction (PCR). PCR allows serial enrichment and the structure of the active molecules is then 
so decoded by DNA sequencing on clones generated from the PCR products. The repertoire is limited to nucleotides and 
the natural pynrridirte and purine bases or those nxxSficafions that preserve specific Watson-Crick pairing and can be 
copied by polymerases. 

[0006] The main advantages of the genetic methods reside in the capacity for cloning and amrjtihcation of DNA 
sequences, which allows enrichment by serial selection and provides a facie method for decaf ng the structure of 
55 active molecules However, the genetic repertoires are restricted to nucleotides and peptides conyo se d of natural 
amino acids and a more extensive chemical repertoire is required to populate the entire universe of birring sites. fri 
contrast dierrical methods can provide limitless repertoires but they lack the capacity for serial enrichment and there 
are difficulties in discovering the structures of selected active molecules. 
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Brief Summary of the Invention 

[0009] The present invention provides a way of combining the virtues of both of the chemical and genetic methods 
summarized above through the construction of encoded combinatorial chemical libraries, in which each of a collection 

5 of polypeptides is labelled by an appended "genetic" tag, itself constructed by chemical synthesis, to provide a "retro- 
genetic" way of specifying each polypeptide. As used herein, the term "polypeptide" includes glycopolypeptides. * 
[0010] In outline, two alternating parallel combinatorial syntheses are performed so that the genetic tag is chemi- 
cally linked to the polypeptide being synthesized; in each case, the addition of one amino acid residue to the structure 
is followed by the addition of an oligonucleotide sequence, which is defined to "code" for that amino acid residue, i.e., 

w to function as an identifier for the structure of the amino acid residue. The Iforary is built up by the repetition of this proc- 
ess after pooling and division. 

[001 1 ] Active molecules are selected from the library so produced by binding to a preselected biological molecule 
of interest or by identifying a species in the library having a desired activity, inducting binding, activation, chemical catal- 
ysis and the like. Thereafter, the identity of the active molecule is determined by reading the genetic tag, i.e., the tden- 
is tifier oligonucleotide sequence. In one embodiment ampfified copies of their retrogenetic lags can be obtained by the 
polymerase chain reaction. 

[0012] The strands of the amplified copies with the appropriate polarity can then be used to enrich for a subset of 
the library by hybridization with the matching tags and the process can then be repeated on this subset Thus serial 
enrichment is achieved by a process of purification exploiting linkage to a nucleotide sequence which can be amplified. 
20 Finally, the structure of the polypeptide is decoded by cloning and sequencing the products of the PCR reaction. 
[0013] The present invention therefore provides a novel method for identifying a polypeptide having a preselected 
binding or catalysis activity through the use of a library of bifunctional molecules that provides a rich source of chemical 
diversity. The library is used to identify chemical structures (structural motifs) that interact with preselected biological 
molecules. 

25 [0014] Thus, in one embodiment the invention contemplates a bifunctional molecule according to claim 1 . 

[001 5] In another embodiment the invention contemplates a Iforary comprising a plurality of species of bifunctional 
molecules, thereby forming a repertoire of chemical diversity. 

[001 6] Another embodiment contemplates a method for identifying a polypeptide that participates in a preselected 
chemical or biochemical interaction with a biologically active molecule, such as binding or catalysis, where the chemical 
30 structure is present in the library of bifunctional molecules accorcfing to this invention. The method comprises the steps 
of: 

a) admixing in solution the Iforary of bifunctional molecules with the biologically active molecule under binding con- 
ditions for a time period sufficient to form a binding reaction complex; 
35 b) isolating the complex formed in step (a) ; and 

c) determining the nucleotide sequence of the polymer identifier oligonucleotide in the isolated complex and 
thereby identifying the polypeptide that participated in the preselected binding interaction. 

[0017] The invention also contemplates a method as claimed in daim 14 for preparing a library according to this 
40 invention. 

[0018) In a related embodiment the invention describes a bifunctional solid support for synthesizing an oligopep- 
tide/ oligonucleotide conjugate comprising: (1) a solid support the solid support being of a type which is cfispersftxe in 
aqueous solution, (2) a first linkage unit cxxjpled to the solid support (3) a seccrt 
age unit and (4) a bifunctional unit oxxxed to trie second lintagei^ 

45 group employable tar oligopeptide synthesis and a second leaving group employable for oigonudeotide synthesis, 
wherein the first leaving group is N-FMOC or its functional equivalent the second leaving group is O-OMT or Hs func- 
tional equivalent wherein the second linkage unit is coupled to the first Enkage unit by means of a bond deavaUe by 
exposure to concentrated aqueous ammonia and wherein the solid support the first lintage unit the second linkage 
uiiit the deavabte bond, aiid the bcfonti 

so tiaily<*enicaflyureeact^ groups and 

cond i tions employed by ottgortudeotkje synthetic 
[0019] FixtherOTtenxxatedisaneJe^ 

soW support being of a type whk*t is dfepera^ (2) a first linkage unrtcxxpled to the solid support 

(3) a second linkage unit coupled to the first Enkage unit (4) a bifunctional unit coupled to the second Enkage unit (5) 
ss an oligopeptide attached to the bifunctional unrt, and (6) an oligonucleotide attached to the bifunctional unit 
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Brief Description of the Drawings 

[0020] In the drawings, forming a portion of this disclosure: 

5 Figure 1 illustrates a scheme for the restriction endonuclease cleavage of a PCR amplification product derived from 
a brfunctional molecule of this invention (Step 1). and the subsequent addition of biotin to the cleaved PCR product 
(Step 2). 

Figure 2 illustrates the first three steps in the process of producing a library of bifuncttonaJ molecules according to 
the method described in Example 9. 
10 Figure 3 illustrates the structure of a preferred bifurtctional linker-support molecule based on a controlled pore glass 
(CPG) support designated "bf-CPG" that is useful in practicing the present invention and is described in Example 

3a 

Detailed Description of the Invention 

15 

A. Encoded Combinatorial Chemical Libraries 

[0021 ] An encoded combinatorial chemical library is a composition comprising a plurality of species of bifunctional 
molecules that each define a different chemical structure and that each contain a unique identifier oligonucleotide 
20 whose nucleotide sequence defines the corresponding chemical structure. 

1. Bifunctional Molecules 

[0022] A bifunctional molecule is the basic unit in a library of this invention, and combines the elements of a 
25 polypeptide comprised of a series of amino acid residues, and a code for identifying the structure of the polypeptide. 
[0023] Thus, a bifunctional molecule can be represented by the formula A-B-C, where A is a polypeptide, B is a 
linker molecule operatively linked to A and C. and C is an identifier oligonucleotide comprising a sequence of nucle- 
otides that identifies the structure of polypeptide moiety A. 

30 a. Chemical Porvmers 

[0024] A chemical moiety in a brfunctional molecule of this invention is represented by A in the above formula A-B- 
C and is a polypeptide comprising a linear series of amino acid residues represented by the formula Xp, wherein X is a 
single amino acid residue in polypeptide A and n is a position identifier for X in polypeptide A. n has the value of 1+i 
35 where i is an integer from 0 to 10, such that when n is 1. X is located most proximal to the linker (B). 

[0025] Although the length of the polypeptide can vary, defined by n, practical Ifcrary size limitations arise if there 
is a large alphabet size as cfiscussed further herein, n is therefore an integer from 4 to 50. 

[0026] The chemical unit X can be selected to form a region of a natural protein or can be a non-natural polypep- 
tide, can be con^xised of nabral D-arri^ 
40 and non-natural amino acids. The non-natural corrtoinations provide for the identification of useful and unique structural 
motifs involved in biological interactions. 

[0027] Nlon-natural amino adds include modtfied amino acids and L-amirto acids, stereoisomer of D-amino acids, 
and any other compounds that can fonrn an arnkte cy pseufo-aniide linhage. 

[0028] The arrino acid residues descrtoed herein are preferred to be in the 1/ isomeric form. "H" refers to the free 
45 amino group r^esert attheainroterrririuso 

boocy terminus & a porypeptida In keeping with standard polypeptide rornenciatura J. Bid. Chem.. 243:3552-59 (1 969) 
and adopted at 37 C.F.R. §1.822(bX2)). abbreviations for amino acid residues are shown on the following Table of Cor- 
respondence: 

so 



TABLE OF CORRESPONDENCE 


SYMBOL 


AMINO ACID 


1 -Letter 


3-Letter 




Y 


Tyr 


tyrosine 
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(continued) 



TABLE OF CORRESPONDENCE 


SYMBOL 


AMINO ACID 


1 -Letter 


3-Letter 




Q 


Gly 


glycine 


F 


Phe 


phenylalanine 


M 


Met 


methionine 


A 


Ala 


alanine 


S 


Ser 


serine 


I 


lie 


isoleucine 


L 


Leu 


leucine 


T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glutamine 


E 


Glu 


glutamic acid 


W 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


asparticacid 


N 


Asn 


asparagine 


C 


Cys 


cysteine 



35 [0029] The phrase "amino acid residue" is broadly defined to indude the amino acids listed in the Table of Corre- 
spondence and modified and unusual amino acids, such as those listed in 37 C.F.R. §1 .822(b)(4), and incorporated 
herein by reference. 

[0030] The polymer defined by chemical moiety A can therefore contain any polymer backbone modifications that 
provide increased chemical diversity. In building of a polypeptide system as exemplary, a variety of modifications are 
40 contenptated, inducing the Mowing backbone structures: -NHN(R)CO-, -NHB(R)CO-, -NHC(RFr)CO-, - 
NHC(=CHR)CO-, -NHCeHtCO-, -NHCHzCHRCCh -NHCHRChfeCO-. and lactam structures. 
[0031] In adcfiticn, amide bond modifications are contemplated including -COCH2-. -COS-, -CONR, -COO-, - 
CSNH-, -CHgNH-, -CHzCHg-, -CH2S-, -CH2SO-, -CH2SO2-, -CHJCKyS-. -CH-CH-. -NHCO-. -NHCONH-, -CONHO-, - 
C^O-yCIV, -POg NH-, -PO2 CH2-. -POg CHzN*-, and -SOjNHV 

45 

b. Polymer Identifier QBgonudeotide 

[0032] An identifier digonudeotide in a Wunctional molecule of this invention is represented by C in the above for- 
mula A-B-C and is an oligonudeotide having a sequence represented by the formula Z^ t wherein Z is a sequence of 2 
so to 10 nudeotides within oligonudeotide C that identifies the chemical unit X at position n. 
[0033] For example; a Afunctional molecule can be represented by the formula: 

X 4 X 3 X 2 X r B-Z 1 2 2 Z 3 Z4. 

55 in this example, the sequence of oligonudeotides Z 1t Z^ Z 3 and Z* identifies the structure of amino add residues X 1 , 
Xg. X3 and X4. respectively. Thus, there is a correspondence in the identifier sequence between an amino acid residue 
X at position n and the unit identif ier oligonudeotide Z at position a 

[0034] The length of a unit identifier digonudeotide can vary depending on the complexity of the Itorary. the number 
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of different amino acid residues to be uniquely identified, and other considerations relating to requirements for unique- 
ness of oligonucleotides such as hybridization and polymerase chain reaction fidelity. Hie length is from 2 to 10 nucle- 
otides. 

[0035] Insofar as adenosine (A), guanosine (G), thymidine (T) and cytidine (C) represent the typical choices of 
s deoxynucleotides for inclusion in a unit identifier oligonucleotide, A, G, T and C form a representative "alphabet" used 
to "spell" out a unit identifier oligonucleotide's sequence. Other nucleotides or nucleotide analogs can be utilized in 
addition to or in place of the above four nucleotides, so long as they have the ability to form Watson-Crick pairs and be 
replicated by DNA polymerases in a PCR reaction. However, the nucleotides A, G, T and C are preferred. 
[0036] For the design of the code in the identifier oligonucleotide, it is essential to chose a coding representation 
10 such that no significant part of the oligonucleotide sequence can occur in another unrelated combination by chance or 
otherwise during the manipulations of a bifunctional molecule in the Itorary. 

[0037] For example, consider a Itorary where 2 is a trinucleotide whose sequence defines a unique amino acid res- 
idue X. Because the methods of this invention provide for all combinations and permutations of an alphabet of chemical 
units, it is possible for two different unit identifier oligonucleotide sequences to have closely related sequences that differ 
15 by only a frame shift and therefore are not easily distinguishable by hybridization or sequencing unless the frame is 
dear. 

[0038] Other sources of misreading of a unit identifier oBgortudeotide can arise. For example, mismatch in DNA 
hybridization, transcription errors during a primer extension reaction to amplify or sequence the identifier oligonucle- 
otide, and the like errors can occur during a manipulation of a bifunctional molecule. 

20 [0039] The invention contemplates a variety of means to reduce the possbDity of error in reading the identifier oli- 
gonucleotide, such as to use longer nucleotide lengths for a unit identifier nucleotide sequence as to reduce the simi- 
larity between unit identifier nucleotide sequences. Typical lengths depend on the size of the alphabet of chemical units. 
[0040] A representative system useful for eliminating read errors due to frame shift or mutation is a code developed 
as a theoretical alternative to the genetic code and is known as the commaiess genetic coda Crick et aL, Proc. Natl. 

25 Acad. Sci. USA. 43:416-421 (1957). 

[0041] A convenient unit identifier nucleotide sequence is the well known genetic code using triplet codons or the 
use of a commaiess code as described by Crick et al. v supra. The invention need not be limited by the translation 
afforded between the triplet codon of the genetic code and the natural amino acids; other systems of correspondence 
can be assigned. 

30 [0042] Atypical and exemplary unit identifier nucleotide sequence is based on the commaiess code described in 
the Examples, and having a length of six nucleotides (hexanucieotide) per chemical unit to provide a length that assures 
complementary hybridization. 

[0043] Preferably, an identifier oligonucleotide has at least 15 nucleotides in the tag (coding) region for effective 
hybridization. In addition, considerations of the complexity of the ftorary, the size of the alphabet of chemical units, and 
35 the length of the polymer length of the chemical moiety afl contrfcute to length of the identifier oBgonudeotide as dis- 
cussed in more detail herein. 

[0044] In a preferred embedment an identifier oligonucleotide C has a nucleotide sequence according to the for- 
mula P1-(ZJ a -P2, where P1 and P2 are nucleotide sequences that provide polymerase chain reaction (PCR) primer 
binding sites adapted to amplify the polymer identifier oligonucleotide. The requirements for PCR primer binding sites 
40 are generally well known in the art but are designed to allow a PC R amplification product (a PCR-amplif ied duplex DNA 
fragment) to be formed that contains the polymer identifier oligonucleotide sequences. 

[0045] The presence of the two PCR primer bincfing sites; PI and P2, flanking the identifier oligonucleotide 
sequence (ZJa provkfes a means to produce a PCR-a^ DNA fragment derived from the bifunctionai mol- 

ecule using PCR. This design is useful to alow the amplification of the tag sequence present on a particular bifunctional 
4s molecule for cloning and sequencing purposes in the process of reading the identfier code to determine the structure 
of the polypeptide in the bifunctional molecule. 

[0046] More preferred is a bifunctional molecule where one or both of the nucleotide sequences P1 and P2 are 
designed to contain a means for removing the PCR primer binding sites from the identifier oligonucleotide sequences. 
Removal of the flanking P1 and P2 sequences is desirable so that their sequences do not contribute to a subsequent 
so hybri diza tion reaction. Preferred means for removing the PCR primer bincfing sites from a PC R arrptification product is 
in the form of a restriction endor^ 

[0047] Restriction endonucteases are wefl taown in the art and are enzymes that recognize specific lengths of 
duplex DNA and cleave the DNA in a sequence-specffic manner. 

[0048] Preferably; the restric ti on endonudease sites should be positioned proximal to (Z„) a relative to the PCR 
55 primer binding sites to nriaximize the amount of P1 and P2 that is removed upon treating a bifunctional molecule to the 
specific restriction endonudease. More preferably. P1 and P2 each are adapted to form a restriction endonudease site 
in the resulting PCR-amptified duplex DNA, and the two restriction sites, when deaved by the restriction endonudeas e , 
form non-overlapping cohesive termini to facifitate subsequent manipulations. 
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[0049] Particularly preferred are restriction sites that when cleaved provide overhanging termini adapted for termini- 
specific modifications such as incorporation of a biotinylated nucleotide (e.g., biotinyl deoxy-UTP) to facilitate subse- 
quent manipulations. 

[0050] The above described preferred embodiments in an identifier oligonucleotide are summarized in a specific 

5 embodiment shown in Figure 1 . 

[0051] In Figure 1, a PCR-amplified duplex DNA is shewn that is derived from an identifier oligonucleotide 
described in the Examples. TTie (ZJ sequence is illustrated in the brackets as the coding sequence and its complemen- 
tary strand of the duplex is indicated in the brackets as the artticoding strand. The P1 and P2 sequences are shown in 
detail with a Sty I restriction endonuclease site defined by the P1 sequence located 5' to the bracket and an Apy I 

w restriction endonudease site defined by the P2 sequence located 3' to the bracket. 

[0052] Stall illustrates the cleavage of the PCR-amplified duplex DNA by the enzymes Sty I and Apa I to form a 
modified identifier sequence with cohesive termini. Step 2 illustrates the specific btotinytation of the artticoding strand 
at the Sty I site, whereby the incorporation of biotinylated UTP is indicated by a B. 

[0053] The presence of non-overlapping cohesive termini after Step 1 in Rgure 1 allows the specific and directional 
is cloning of the restriction-digested PCR-amplified fragment into an appropriate vector, such as a sequencing vector. In 
addition, the Sty I was designed into P1 because the resulting overhang is a substrate for a f illing-in reaction with dCTP 
and biotinyl-dUTP (BTP) using DNA polymerase Klenow fragment. The other restriction site, Apa I, was selected to not 
provide substrate for the above biotinylatioa so that onfy the artticoding strand can be biotinylated. 
[0054] Once biotinylated, the duplex fragment can be bound to immobilized avidin and the duplex can be denatured 
20 to release the coding sequence containing the identifier nucleotide sequence, thereby providing purified anticoding 
strand that is useful as a hybridization reagent for selection of related coding strands as described further herein. 

c. Unter Molecvles 

25 [0055] A linker molecule in a bifunctional molecule of this invention is represented by B in the above formula A-B-C 
and can be any molecule that performs the function of operatively linking the polypeptide to the identifier oligonucle- 
otide. 

[0056] Preferably, a linker molecule has a means for attaching to a solid support thereby facilitating synthesis of 
the bifunctional molecule in the solid phase. In addition, attachment to a solid support presides certain features in prac- 
30 tiring the screening methods with a Ifcrary of bifunctional molecules as described herein. Particularly preferred are 
linker molecules in which the means for attaching to a solid support is reversible, namely, that the linker can be sepa- 
rated from the solid support 

[0057] A linker molecule can vary in structure and length, and provide at least two features: (1 ) operative linkage to 
chemical moiety A, and (2) operative linkage to identifier oligonucleotide C. As the nature of chemical linkages is 

35 diverse, any of a variety of chemistries may be utilized to effect the indicated operative linkages to A and to C, as the 
nature of the linkage is not considered an essential feature of this invention. The size of the linker in terms of the length 
between A and C can vary widely, but for the purposes of the invention, need not exceed a length suff icient to provide 
the linkage functions indicated. Thus, a chain length of from at least one to about 20 atoms is preferred 
[0058] Preferred linker molecules are described in Example 3 herein that contains the added, preferred, element of 

40 a reversible means for attachment to a solid support That is, the brfeinctionaJ molecule is removable from the solid sup- 
port after synthesis. In another embodiment the linker has the ability to be cleaved slowly over time to release the 
bifunctional molecule; or portions thereot 

[0059] A variety of chemo- or enzyme-cotoctive deavage fiinctionaities could be incorporated into a linker-support 
molecule useful in the present invention. For example, a 44iydrcxymethyl phenoxyacetic acid moiety provides an aod- 
45 deavabte linker. A 2-Rtert-butytf phenyt^^ acid moiety provides a fluoride-deavable moiety. The 

phosphate of a 2-hydraxymethyt benzoic acid moiety provides a site cteavaUe by the combination of alkaline phos- 
phatase treatment followed by mild altalro treatment Thus, the ^ 
those recited above are also considered a part of the invention 

[0060] SoSd supports for chemical syndesis are generally well known Particularly preferred are the synthetic res- 
so ins and controlled pore glass (CPG) supports used in oligonucleotide and in polypeptide synthesis that are available 
from a variety of commerces (Herndoa VA), Bachem Biosciences (Philadelphia, PA), 

Sigma Chemical Ca (St Louis, MO). CPG fcna, (Fairfield, NJ)and Applied Biosystems (Foster City. C A). Most preferred 
are teflon and CPG supports such as are de6crt>ed in Example 2. 

[0061 ] In a related embo di men t the invention describes a preferrred bifunctional solid support particularly suited 
55 for producing a bifunctional molecule of this invention having a polypeptide as the chemical moiety (i.a, an oligopep- 
tkJefoligonudeotide conjugate). 

[0062] A prefened bifunctional solid support for synthesizing oligopeptide/oligonucleotide conjugates comprises a 
solid support a first Inkage unit a second linkage unit, and a bifunctional unit A "bifunctional unif in this context is not 
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to be confused with a Afunctional molecule of this invention, and refers instead to that chemical moiety present on a 
bifunctional solid support which provides the two (bi-) reactive functionalities, one for coupling the peptide and another 
for coupling the oligonucleotide. An exemplary bifunctional unit is shown in Figure 3 as the serine-branch monomer fol- 
lowing the aminohexanol linker. 

5 [0063] The solid support used herein is of a type which is cfispersfcte in aqueous solution, such as were described 
above. A preferred solid support is a controlled pore glass of a type employable tor peptide and oligonucleotide synthe- 
sis, e.g. aminopropyl-CPG supplied by Sigma The solid support is tightly coupled to the first Gnkage unit A preferred 
first linkage unit includes a sarcosine linker coupled to the amincpropyl-CPG and a succinyt linker coupled to the sar- 
cosine linker by means of an amide bond. The second linkage unit is, in turn, coupled to the first linkage unit. A preferred 

w second linkage unit is an arrinohexand group. A preferred bond for coupling the first and second linkage units is an 
alkyl ester. Alkyi esters are readily hydroiysed or cleaved upon exposure to concentrated aqueous ammonia. 
[0084] The bfeinctional unit has a first leaving group employable for oligopeptide synthesis and a second leaving 
group employable for oligonucleotide synthesis. A preferred bifunctional unit is an L-serine residua The L-serine resi- 
due includes an amino end, a cartaxyl end, and a hydroxy! end. The serine residue is coupled at its carboxyf end to the 

is second linkage unit In a preferred embodiment the cartaxyl end of the serine residue is couple by means of an amide 
bond to an aminohexanol linker. The serine residue is also coupled at its amino end to the first leaving group and at its 
hydroxy! end to the second leaving group. A preferred first leaving group is N-FMOC (N^9-fluorenylmethoxycarbonyi] 
or its functional equivalent A preferred second leaving group is O-DMT (O-dtrnetrraxytrityl) or its functional equivalent. 
[0065] The solid support, the first Gnkage unit, the second Gnkage unit the cleavable bond therebetween, and the 

20 bifunctional unit exclusive of said first and second leaving groups, are each substantially chemically unreactive under 
conditions employed during conventional oligopeptide synthesis protocols using FMOC leaving groups [see: Bodan- 
szky et al.. in The practice of Peptide Synthesis, Springer-Verlag, (1984); and Bodanszky et al., in Principles of Peptide 
Synthesis, Springer-Verlag, (1984)] and under conditions employed during conventional oligonucleotide synthesis pro- 
tocols using O-DMT leaving groups and phosphorarridfte donors. 

25 [0066] In an alternative embocfiment the bifunctional solid support also includes a third linkage unit The third link- 
age unit is interposed between and coupled to the bifunctional unit and the first leaving group. In a preferred mode of 
this embodiment the third linkage unit is photosensitive, i.a it is cleavable by exposure to ultra-violet light The third link- 
age unit may include a 3-nHro-4-brornornethyl berc amino end of 
the bifunctional serine residue and coupled by means of an ester bond to an FMOC biocked amino acid. 

30 

2. Libraries 

[0067] A Itorary of this invention is a repertoire of chemical diversity comprising a plurality of species of bifunctional 
molecules according to the present invention. The plurality of species in a Itorary defines a famfly of chemical diversity 

35 whose species each have a different polypeptide moiety. 

[0068] The number of different species in a library represents the complexity of a library and is defined by the length 
of the polypeptide, and by the size of the amino acid alphabet that can be used to build the polypeptide. The number of 
different species referred to by the phrase "plurality of species" in a library can be defined by the formula V*. i.a, V to 
power of a (exponent a). V represents the alphabet siza i.e.. the number of different amino acids X available for use in 

40 the polypeptide. "a" isanexponerrtto Varxirepreserte 
A, La, the length of polypeptide A. 

[0069] toexarrptatorabifurxtfonaJ acids, and 

where the arriino adds utifized can be an^ 

(a) is 6, and the Itorary size is 20 6 or 64 million. This exemplary Itorary provides a repertoire of chemical diversity com- 
45 prising 64 million different hexarneric polypeptides operativeiy finked to corresponding unique identifier oligonucle- 
otides. 

[0070] Because the conptexhy of fre itorary 

cute relative the other species in the Itorary, there are theoretical Irrots to the maximum useful complexity in a Itorary. 
Another I irritation is the Itorary size dehned by tr»rurt>er erf actual st 
so Therefore it is useful to consider ftow 

sitivity for detecting the presence of a polymer identifier oligonucleotide after a screening procedure according to this 

inventior^ Detection sensrlrv^B 

to be assayed and a bifunctional moiecula 

[0071] If, for exampla the bcnrjng threshold is 10^M (rriaornolar), then there must be at least one nanomoie of 
55 each species in a Itorary of 1 milKer (ml) volume At this threshold, a Itorary having a complexity of 10 4 could contain 
1 0 rricromoles of each species. Because of the reciprocal relatirjnship between 
old, nriorea>nrptaltorariesareposeto^ 

[0072] The relative amounts of the individual brfunctfonal molecule species within the library can vary from about 
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0.2 equivalents to about 10 equivalents, where an equivalent represents the average amount of a species within the 
library. Preferably each species is present in the library in approximately equimolar amounts. 
[0073] In a preferred embodiment, a library contains the complete repertoire of chemical diversity possible based 
on the mathematical combinations for a given Itorary where there is a fixed alphabet and a preselected number of chem- 

5 ical units in afl species of the library. Thus a complete repertoire is one that provides a source of all the possible chem- 
ical diversity that can be found in a library of this invention having a fixed alphabet and chemical length. 
[0074] It is particularly preferred that a library be comprised of brfunctional molecules where each species of Afunc- 
tional molecule contains the same nucleotide sequence tor either the Pt or P2 PCR primer binding sites. A Itorary with 
this design is particularly preferred because, when practicing the methods of this invention, a single PCR primer pair 

10 can be used to amplify any species of identifier oligonucleotide (coding sequence) present in the library. 

[0075] In a related embodiment the invention contemplates an element of the Itorary, namely an oligopeptide/oli- 
gonucleotide conjugate as described earlier. 

[0076] Thus, an element of a fibrary of oligopeptide/oligonucleotide conjugates in one embodiment comprises a 
solid support a first linkage unit, a second linkage unit, a Wunctional unit an oligopeptide, and an oligonucleotide. The 
75 solid support is of a type which is dispersibte in aqueous solution. The first linkage unit is coupled to the solid support 
The second linkage unit is coupled to the first linkage unit The brfunctional unit is coupled to the second linkage unit 
The oligopeptide and the oligonucleotide are attached to the brfunctional unit 

[0077] In an alternative embodiment the library element includes a cleavable bond for coupling the first linkage unit 
to the second linkage unit. The cleavable bond is of a type which is cleavable by exposure to concentrated aqueous 
zo ammonia 

[0078] In another alternative embodiment, the library element includes a cleavable bond for coupling the Afunc- 
tional unit to the oligopeptide. The cleavable bond may be of a type which is cleavable by exposure to ultraviolet fight 
e.g. a 3-nRro^4rorrttmethyl benzoate group coupled by means of an amide bond to the amino end of the brfunctional 
serine residue and coupled by means of an ester bond to an FMOC blocked amino acid. 
25 [0079] The library element may also be made without the solid support and the first and/or second Bnkage units. In 
this instance, the library element comprises a brfunctional unit an oGgopeptide attached to said bifuncrjonal unit and 
an oligonucleotide attached to sad brfunctional unit. An alternative to this embodiment includes a cleavable bond for 
coupling the brfunctional unit to the oligopeptide. The cleavable bond may be of a type which is photosensitive, i.e. 
cleavable by exposure to ultraviolet light, as indicated above. 

30 

B. Methods for Producing a Library 

[0080] The present method for producing a plurality of brfunctional molecules to form a Itorary of this invention 
solves a variety of problems regarding efficient synthesis of large numbers of different species. 

as [0081 ] In the present synthesis methods, the sequential steps of first adding an amino acid residue X followed by 
the addition of an oligonucleotide sequence to the linker molecule requires an alternating parallel synthesis procedure 
to add amino acid residue X and then add a unit identfier nucleotide sequence Z that defines (codes for) that corre- 
sponding amino acid residua The library is built up by the repetition of this alternating parallel process after pooling and 
division of the reaction products as described herein. 

40 [0082] The only constraint for making an encoded Itorary is that there must be compatible chemistries between the 
two alternating syntheses procedures for adding an amino acid residue as compared to that for adding an oligonucle- 
otide sequence. 

[0083] The problem of synthesiser^ 

alternating polymers »e synthesized, and by the correct choice of methods for deprotection of one growing polymer 
45 selectively while the other growing polymer remains blocked, such as by the use of transiert protects 

use of protection groups removable under specTc deprotection chemistry Suitable compatible protection chemistries 

are descrtoed herein, and additional suitable cherristries are also des^ 

avaiaWefromSyncpsy^ 

W0ey& Sons (1991). 
so [0084] The synthesis of a Itorary having a 

(t) A tinker molecule is provided that has suitable means for operafveiy linking the first amino acid residue X<, and 
for operative^ Inking the first nucleotide sequence defining a unrt identifier nucleotide Z 1 whose sequence codes 
for (defines) the structure of X,. Preferably the linto has a means tor attachm^ 
55 allows for the synthesis to proceed in the solid phase 

Thus the provided linker molecule has a structure A'-B-C'. where A* represents a termini adapted tor reaction 
to operativery link an amino acid residue X in precursor form {X). and C represents a termini adapted for reaction 
to operatively link an identifier oligonucleotide Z in precursor form {Z). The termini A' and C* are protected by 
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respective blocking groups so that during operative linking reactions at one termini, the other termini is protected 
from reaction. 

(2) The linker molecule is then subjected to a first cycle of synthesis to add a building block at one termini. The order 
of synthesis is not generally important insofar as one may elect to add an amino acid residue X first to termini A', 

5 or add an identifier oligonucleotide Z first to termini C\ A first cycle involves the steps of deprotecting the termini of 
the Onker to which a building block is to be added and then acting the building block to the termini. Typically, the 
added builcfing block contains a blocking group at its free termini, i.&, the termini that win participate in an addition 
of the next building block of its type. The linker molecule is then subjected to a second cycle of synthesis to add a 
building block at the other (second) termini. A second cycle involves the steps of deprotecting the second termini 

10 of the linker to which a building block is to be added and then adding the building block to the termini. Again, the 
added builcfing block is typically blocked at its free termini. 

The adcfiticn of identifier oligonucleotide Z to termini G can be conducted either nucleotide by nucleotide to 
form the complete unit identifier nucleotide sequence Z, or Z can be presynthesized, and the oligonucleotide Z 
added as a block to termini C\ Insofar as the synthesis of oligonucleotides is well known in the arts, the presynthe- 

15 sis of oligonucleotides, and their addition to the growing nucleotide polymer in blocks is preferred because it 
reduces the number of manipulations in synthesizing a brfunctional molecule 

An amino add residue X or a unit identifier oligonucleotide Z is referred to as a precursor (X* or Z) to indicate 
that it contains a leaving group compatible with the reaction chemistry that facilitates the precursor's operative link- 
age to the growing polymer at the appropriate termini. 

20 The product resulting from step (2) is a bifunctional molecule having the structure A'-X 1 -B-Z r C, and is ready 

for a repetition of the above first and second cycles to add X2 and Z2 to the growing polymers. 

(3) After the bifunctional molecule product A*-X t •B-Z 1 -C* is formed, aliquots of the product are made, and the cycles 
in step (2) are repeated on each aliquot with the exception that a different species of X (and its corresponding Z) 
is added in each cfifferertt aliquot The reaction product in each aliquot has the structure A'-X 2 -X 1 -B-Z r Z 2 -C'. 

25 (4) The afiquots each containing the product A'-X r X r B-Z 1 -Z 2 -C are combined (pooled) to form a mixture of differ- 
ent bifunctional molecules, and the mixture is divided into aliquots. The cycles in step (2) are repeated again on 
each aliquot with (Efferent X and Z buikfing blocks being added to each aliquot to form the bifunctional molecule 
product A-*X3-X2-Xt-B-Z«|-Z2-Z3-C\ 

30 [0085] The process of pooling, aliquoting and adding a next set of buikfing blocks X and Z are then repeated at posi- 
tions n=4, 5,6... and so on depending on the length of polymers desired. As the cycles are repeated, and the polymers 
grow in length, the complexity of the resulting Ifcrary also increases. For each cycle, the polymer length a increases by 
one and the lixary complexity therefore increases exponentionalty according to the formula V*. In preferred embodi- 
ments, the cycles are repeated from about 1 to 10 times. 

35 [0086] tn a related ernbcxfiment the provided linker in step (1) is first divided into aliquots. and the cycles of step (2) 
are conducted on each aliquot adding a different X and corresponding Z to the linker in each different aliquot The aliq- 
uots are then pooled as before, and the cycles of step (2) can be repeated on one or more aliquots. 
[0067] Thus the steps of (i) tfviding a linker or pool into aliquots, (ii) parallel addition of X and Z to the linker sub- 
strate in separate afiquots. and (Hi) pooling of the aliquots, can be cycled (repeated) to sequentiafly add the chemical 

40 units and their corresponding unit identifier oligonucleotides to form the Iforary cornphsing a plurality of bifunctional mol- 
ecules each having a different chemical polymer operativery linked through the linker to a corresponding identifier ofi- 
gonucteotideL 

[0068] In a preferred embocfimerrt, a nrtethod for forming a Ovary of this invention includes the steps for adcfition of 
the PCR primer bmdmg sites P1 and P2 to each of the bifunctional rnoleodes in the ibrary. 

45 [0069] The nriethod is substantially the sam 

synthesized P1 oligonucleotide to the linker molecule provided in step (1 ) prior to the cycles of step (2) that add X and 
Z. Because afl member s of the forary are to contain the same P1 sequence, P1 is added to the C termini of inker mol- 
ecule A'-B-C prior to dwiolnQ tf>e !irtl» rntoafit^jots and subjecting the aliquots to the cycles of step (2) adding Xj and 
Z%. The resulting product has the formula A'-B-PI-C. 

so [00901 Ther e aft e r , the product is afiquot e d and cycled as before; resulting in the preparatfon of the p 
B-P1-(2y a -C\ where a indicates the presence of a polymer of tength "a". 
[0091] Next the pooled adnubrecort 

of nucleotides or a presynthesized oigonudeotide P2 at termini C to form the product A^XJa-B-P 1-{Z n ) a -P2-C. Thus 
aB members of the Gbrary contain a common sequence P1 and a common sequence P2 from which universal PCR 
55 reactions can be conducted, regardess of the species of brfunctional molecule present from the fixaiy 

[0092] In a related entxx ft 1 «nt the method for producing a B*ary of this invention utiBzes a bf-CPQ solid support 
as described herein 
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1. Polypeptide Libraries 

[0093] In one preferred embodiment, the invention contemplates a library, and methods of producing the library, 
where the Afunctional molecule has a polypeptide for polymer A. 

5 [0094] In this embodiment, the compatible chemistries for sequentially adding amino acids and oligonucleotides to 
the growing polymers has been developed fa the synthesis of an amino acid polymer in the direction of carboxy to 
amino terminus, and alternatively in the direction of amino to carboxy terminus. Chemistries have also been developed 
for the synthesis of an oligonucleotide polymer in the direction of 3' to 5*. and alternatively in the direction of 5' to 3*. In 
addition, in each of these syntheses it is preferred that the amino acid side chains (R groups) be blocked for certain 

io amino acid residues where the R group provides an otherwise reactive termini during one of the synthesis or deblocking 
steps. 

[0095] Each type of chemistry will be described in detail herein below. 

[0096] For any of the syntheses, the reactive side chains of several amino acids must be blocked. Table 1 below 
lists those of the natural amino acids which have an R group that preferably contains a blocking group. Any compatible 
75 protecting (blocking) group may be utilized, and the invention is not to be so limited to any particular blocking group. 
Also indicated in Table 1 are preferred blocking groups. 
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TABLE 1 



Amino Af^ri 
ml 1 ill iu nwu 


DlUVfUll^j VII UU^i 


niyu ill 10 


N-MTr 1 N-PMn 7 

1 n VI II , 1^1 Fnnw 


ruoUUJiio 




Cysteine 


S-Trt 3 , Acm 11 , S-t-butyl 


Tryptophan 


N'-CHO, none 


Tyrosine 


O-TBS 4 


Aspartic acid 


O-TSE 5 , DMB 12 


Glutamic acid 


O-TSE 5 DMB 


Serine 


O-TBS 4 


Threonine 


O-TBS 4 


Lysine 


N-Bz 6 TFA 8 TEOC 9 


Asna ran in a 


none 


VmMUIAJ 1 III lv 


none 


Glycine 


none 


Phenylalanine 


none 


Methionine 


none 


Alanine 


none 


Isdeudne 


none 


Leucine 


none 


Valine 


none 


Proline 


none 



1 MTr is N 9 -4-m^tra<y-2,3,6-trimethytoenzene suffo- 
nyt 

2 Bum is tert-butoxymethyl. 
^ Trt is tnphenyknethyl. 

5 TSE is trtmethylsyflethytester 

4 TBS is tertHDutyWimethyteilyiesier. 

6 Bz is benzyl 

7 PMC is N G -2 1 2^7.&i»rtamdhyk^omon-6- 
su^phonyl. 

8 TFAisWuoroacetyt. 

9 TEOC is fHtrimettiyl silyOethaxycarbonyl. 

10 SEM ts Htrimethyl slyOethoxymethyi. 

11 Acm is acetanudomelhyL 

12 DMB is (fimemocybenzyi 



[0097] Protected amino acids suitable as a blocked precursor for addition to a brfunctional molecule can be 
obtained from a variety of commercial vendors including Bachem Biosciences Inc. (Philadelphia, PA), Peninsula Labs 
(CA), and Nova Biochem(CA). In adcfitioa the preparation of pro t ec te d amino acids is described at least in Example 1 . 

a. Polypeptide Synthesis 

[0098] For synthesis of a polypeptide on the inker substrate in the direction of carbooty to amino terminus, a free 
ammo terminus on the linker is required that can be conveniently blocked and deblocked as needed. A preferred amino 
terminus blocking group is a ffejorefiytmethcxycaitonyl group (FMOC). 

[0099] FMOC blocked amino termini are deblocked using 20% (v/v) piperkfine in dirrrethytfarrarnide (DMF), or 
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using 1 ,8-diazatricyclo[5,4,0]undec-7-ene (DBU) in dichloromethane (DCM) as is well known for polypeptide synthesis. 
The amino acid units are added in the form of blocked amino acids having FMOC blocked amino termini and a carboxyl 
terminus blocked with pentafluorophenyi ester (Opfp) or by the activation of the free acid using BOP, HBTU, TBTU or 
preferebly with pyBOR BOP is ben2otriazole-1-yl-oxy-tris(dimethylamino)-phosphonium hexaftuorophosphate (Castro's 

5 reagent). HBTU is 2-(1H-benzotria20le-1-yO-1,1.3.3-tetramethyluronium hexafluorophosphate. TBTU is 2-(1H-benzot- 
riazole-1 -yl)- 1 , 1 ,3,3-tetrarnethyf uronium tetrafluoroborate. pyBOP is bensotriazoJ-1 -yloxytris (dimethylamino phospho- 
nium hexafluorophosphate. The addition reaction requires the blocked amino acid, dimethytfbrmarrtide (DMF) and 
hydroxy-benzotriazole (HOBt) as is well known for peptide synthesis. The resulting product contains an added amino 
acid residue with a FMOC-Wocked amino terminus, ready for deblocking addition of a subsequent blocked amino acid 

10 as before . 

[0100] For synthesis of a polypeptide on the linker substrate in the direction of amino to carboxy terminus, a free 
carboxy terminus on the linker is required that can be conveniently blocked and deblocked as needed. A preferred car- 
boxy terminus blocking/activating group is the Opfp ester described before. A carboxy terminus on the linker is pro- 
duced by reacting a linker with a free amino terminus with succtnamide in HOBT and a proton catalyst Thereafter, the 
is terminus can be modified by reaction with perrtaflwxophenol in dicydohexylcarbodumide (DCC) and ethand acetate to 
form an Opfp ester at the free carboxy terminus. The Opfp ester is blocked linker terminus is available for adcfition reac- 
tion with a FMOC-, Opfp- blocked amino acid as before, but with the amino acid adding to the linker in the reverse direc- 
tion. The resulting product contains an added amino acid residue with an Opfp-blocked terminus, ready to repeat the 
addition with a subsequent blocked amino acid. 

20 

b. QliqonugleQMe Synthesis 

[0101 ] Ofigonudeotides can be synthesized by a variety of chemistries as is well known An excellent review is "Oli- 
gonucleotide Synthesis: A Practical Approach", ed. MJ. Gait IRL Press, Oxford, (1984). Preferred oligonucleotide syn- 
25 thesis can be carried out on any of a variety of automated DNA synthesizing machines such as are available from 
Applied Biosystems Inc., (ABI; Foster City, CA). Additional excellent references for oligonucleotide synthesis and DNA 
synthesis machines and technology include Caarthers, Science, 230281-285 (1985); and Caruthers, Acc. Chem. Res.. 
24278-284(1991). 

[0102] For synthesis of an oligonucleotide on the linker substrate in the direction of 3' to 5\ a free hydroxy terminus 
30 on the linker is required that can be conveniently blocked and deblocked as needed. A preferred hydroxy terminus 
blocking group is a dimethoxytrityl ether (DMT). DMT blocked termini are first deblocked, such as by treatment with 3% 
dichloroacetic acid in dichloromethane (DCM) as is weD known for oligonucleotide synthesis, to form a free hydroxy ter- 
minus. 

[0103] Nucleotides in precursor form for addition to a free hydroxy terminus in the direction of 3* to 5* require a phos- ~~ 
35 phoramidrte moiety having an dasopropyl phosphorarrtidite at the 3* terminus of a nucleotide In adcfition, the oxygen of 
the phosphoramidite is blocked with a cyanoedhyi group (CNE), and the 5* terminus is blocked with a DMT ether. 
[0104] The addition of a 5 DMT-. 3' CNE- blocked nucleoside phosphoramidite to a free hydroxy! requires tetrazde 
in acetonitrile followed by (iodine) oxidation and capping of unreacted hydroxyts with acetic anhydride, as is well known 
for oligonucleotide synthesis. The resulting product contains an added nucleotide residue with a DMT blocked 5* termi- 
40 nus, ready for deblocking and addrtkxi of a subsequent blocked nucleotide as befora 

[0105] For synthesis of an oigonucteotkte on the (inter in the direction of 5* to 3\ a free hydroxy terminus on the 
linker is required as befora However, the blocked nucleotide to be added has the blocking chemistries reversed on its 
5* and 3' termini to facState addition in the opposite orientation. 

[0106] A nucleotide with a free 3' hydroxy! and 5* DMT ether is first blocked at the 3* hydroxy terminus by reaction 
45 with TBS-G in imidazole to form a TBS ester at the 3' terminusi Then the DMT- blocked 5* terminus is deblocked with 
DCA in DCM as before to form a free S hydroxy terminus. The reagent (N,NHjiisopropylarrTO)(cyarx)e^ phcKphon- 
amkfic chloride is reacted in tefcrahydrofuran (THF) with the 5* deblocked nucleotide to form the arrtnodiisopropyK CNE- 
btocked phosphorarnkfite group on the 5* terminus. Thereafter the 3' TBS ester is removed with tetrabutylammontum 
fluoride (TBAF) in DCM to form a nucleotide with the phosphoramidrte-blocked 5 terminus and a free 3' hydroxy terrrt- 
so nua Reaction in base with DMT-C1 adds a DMT ether blocking group to the 3' hydroxy terminus. 

[0107] The adcfition of the 3* DMF, 5* CNE- blocked phosphoramidrted nucleotide to a finter substrate having a free 
hydroxy terminus fren proceeds using the previous tetrazde catalyzed reaction, as is well known for oligonucleotide 
synthesis. The resulting product contains an added nucleotide residue with a DMT-btocfeed 3' terminus, ready for 
deblocking with DCA in DCM and the adcfition of a subsequent blocked nucleotide as before 
55 [0108] The above demonstrates that the present brfunctional molecules can be synthesized having polypeptide 
(X) a in either orientation and having the polymer identifier oligonucleotide (Z) a in either orientation. Exemplary is the 
synthesis described herein in detail to form a lixary of bHunctional molecules having the oligonucleotide attached to 
linker through rts 3' terminus and having the peptide attached to linker through its carboxy terminus. 
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[0109] In one preferred embodiment, the order of synthesis orients the polypeptide on the linker such that after 
addition an added amino acid has a free amino terminus, that is, the polymer is assembled in the direction from carboxy 
to amino terminus. Exemplary chemistry for this synthesis is described in the Examples. 

[0110] The addition of oligonucleotides rather than single nucleotides to the growing polymer identifier nucleotide 
sequence is an alternate embodiment that is preferred because it affords more rapid and modular assembly of the 
library. Although the previous synthesis discussions involved single nucleotide base units, the same blocking groups 
and addition chemistries apply where an oligonucleotide is to be added. 

[0111] The synthesis of a oligonucleotide having 5' CNE-btocked and 3* DMT-blocked termini or having 3* CNE- 
Wocked and 5' DMTWockBd termini can readSy be prepared using the ofigonucieotide synthesis methods presently 
available and described herein. 

[0112] After synthesis of a brfunctional molecule, or Itorary of molecules, the blocking groups at termini and at 
amino acid side chains are removed. Because of the relative lability of terrnini. it is preferred that the order of deblocking 
be selected as to preserve the functionalities, particularly the side chain functionalities. 

[01 13] In the present preferred embodiment for a polypeptide Itorary, the following sequence of deprotecting is pre- 
ferred when using the teflon support and 5 BCM3 linker described in Examples 2-3: 

1) tetrabutyl ammonium fluoride (TBAF) treatment to remove TBS and TMS ethyl ethers; 

2) brief 5 minute treatment with trifluoroacetic acid (TFA) treatment sufficient to remove MTr, Bum, PMC and Trt 
groups, followed by neutralization for 5 minutes with triethylamine; 

3) aqueous ammonia treatments to remove Bz and CNE groups; and 

4) cleavage of the brfunctional molecule from the solid support using a periodate oxidation. 

[0114] Alternatively, the following sequence of deprotection is used on the CPG linker described in Example 3 
either with or without the photoactivety cleavable linker moiety: 

1) TBAF treatment to remove TBS and TMS ethyl ethers; and 

2) aqueous ammonia treatment to remove Bz and CNE groups. 

[01 1 5] As indicated, after the Itorary has been synthesized, and after the protecting groups have been removed, the 
bifunctional molecules may be cleaved off of the solid support, and the released bifunctional molecules separated from 
the sofid phase to form a solution comprising a plurality of bifunctional molecules. Alternatively, the library may be main- 
tained in the form of a plurality of btfunctional molecules in the solid phase. 

[0116] Although natural amino acids are used in the Examples, the present invention is not to be so limited. The 
alphabet of possible amino acid residues can be extended to include any molecule that satisfies the basic chemistry 
defining an amino add, namely cartxxxyl and amino termini. Upon polymerization, an amide bond is formed. Thus the 
possible amino acids can arciude L-arrino acids, D-arrino adds, natural amino acids, non-natural amino acids, and 
derivatives thereof, including pharmaceutical^ active molecules. 

[0117] In addrtion, there is no basis to limit the polypeptide backbone connecting the termini to the conventional 
amino acid structure The amino and carbcxyltc acid moieties can be on any backbone having any side group substit- 
utes, so long as the side groups are properly blocked as described herein. Previously undeserved amino acids may 
be developed that can be used in the present invention, having heteroatomic configurations, including unusual hetero- 
cyclic rings, such as thiazole-aiartirie or purine alanine 

[0118] The development and use of both conventional and unusual amino acid structures provide a greater <Sver- 
sity of chemical moieties for a Itorary of this invention. Such Bbraries allow the exploration by the screening methods of 
this invention of new combinations of important core chemical structures. 

[0119] Typical backbones can be alMchairo In adefition. the alphabet 

can comprise amino acids of varying baddwnestnxturessu^ 

amino acids where the number of carbon atoms and their configuration in the backbone can be varied, 
a Methods for Identify^ 

[01201 The Itorary of this invention provides a repertoire of chemical drversity such that each chemical moiety is 
linked to a genetic tag that feeffitates identification of the chemical structure. 
[0121] By the presemsgeenirig methods, 

interactions or chemical catalysis events wflh a biotogkafy active molecule by drawing upon a repertoire of structures 
randomly formed by the oa n to i i iah ji ia l assoc iat ion erf <fiw 
them one at a time or trowing their interact*** in advance 

[0122] The invention therefore also contemplates a method for identifying a chemical structure that participates in 
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a preselected binding or catalysis interactions between the chemical structure and a biologically active molecule. The 
chemical structure to be identified is represented by one of the members of a library of this invention, and the method 
comprises the following steps: 

5 (i) A library according to the present invention is admixed with a preselected biologically active molecule under 
binding conditions (i.e., a binding reaction admixture) for a time period sufficient for the biologically active molecule 
to interact with at least one bifunctional molecule of this invention present in the library and form a binding reaction 
complex. 

(2) The binding reaction complex is then isolated from the library admixture to form an isolated complex. 
w (3) The nucleotide sequence of the polymer identifier oligonucleotide present in the isolated binding reaction com- 
plex is determined. The nucleotide sequence provides a code that defines the chemical structure that participated 
in the binding reaction, and thus determining that sequence identifies the chemical structure that participates in the 
binding reaction with the biologically active molecule. 

is [0123] Where catalysis is the activity to be identified, a reactant can be identified indicating the occurrence of a cat- 
alytic event and the causative solid support in the library is selected as the cancfidate catalytic mdecula 
[0124] A typical biologically active molecule exhtoiting a preselected binding interaction or catalytic reaction can be 
any of a variety of molecules that bind selectively to and/or react with another molecule, including antibodies to anti- 
gens, lectins to oligosaccharides, receptors to ligands, enzymes to substrates and the like mediators of molecular inter- 

20 actions and can be catalytic molecules like proteases. Therefore, a preselected binding interaction is defined by the 
selection of the biologically active molecule with which a library member is to bind. Similarly, a preselected catalytic 
activity is defined by selection of a substance with which a library member is catalyticalty active. The binding reaction 
will be discussed as exemplary, although it is understood that the procedures can be readily adapted for use to detect 
catalytic polymers. 

25 

1. Birring Reaction Admixtures 

[0125] The admixture of a library of the invention with a biologically active molecule can be in the form of a hetero- 
geneous or homogeneous admixture. Thus, the members of the library can be in the solid phase with the biologically ~, 
30 active molecule present in the liquid phase. Alternatively, the biologically active molecule can be in the solid phase with 
the members of the fibrary present in the liquid phase. StiH further, both the library members and the biologically active _ 
mdecuJe can be in the liquid phase. 

[0126] Binding conditions are those conditions compatible with the known natural binding function of the biologi- u 
cally active molecule. Those compatible condtions are buffer, pH and temperature conditions that maintain the biolog- , 
35 teal activity of the biologically active molecule, thereby maintaining the ability of the molecule to participate in its < 
preselected binding interaction. Typically those conditions include an aqueous, physiologic solution of pH and ionic 
strength normally associated with the biologically active molecule of interest 

[0127] For example, where the binding interaction is to identify a member in the library able to bind an antibody mol- 
ecule, the preferred binding conditions would be conditions suitable for the antibody to irnmunoreact with its irnmuno- 
40 gen. or a known immunoreacting antigen. For a receptor molecule, the binding conditions would be those compatftxe 
with measuring receptoMtgand interactions. 

[0128] A time period sufficient for the admixture to form a binding reaction complex is typically that length of time 
required for the biologically active molecule to interact with its normal binding partner under condffions compatible with 
interaction. Although the time periods can vary depending on the molecule, admixing times are typically for at least a 
45 few minutes, and usually not longer than several hours, although nothing is to preclude using longer admixing times for 
a binding reaction complex to form. 

[0129] A binding reaction complex is a stable product of the interaction between a biologically active molecule and 
a bifunctional molecule of this invention. The product is referred to as a stable product in that the interaction is main- 
tained over sufficient time that the a)mptex can be isolated from the rest of them 
so plex becoming significantly disassociated. 

2. isolation of a Stoncfonal MdecuJe from the Binding Reaction Admatofi 

[0130] A bindmg reaction complex is isolated from fre binding reaction admixture by any separation means that is 
55 selective for the complex, thereby isolating that species of bifunctional molecule which has bound to the biotogicaMy 
active mdecula There are a variety of separation means, depending on the status of the biologically active mdecula 
[0131] For example, the biologicaly active molecule can be provided in admixture in the form of a solid phase rea- 
gent i.a. affixed to a solid support and thus can readily be separated from the liquid phase, thereby removing the 
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majority of species of afunctional molecule. Separation of the solid phase from the binding reaction admixture can 
optionally be accompanied by washes of the solid support to rinse Afunctional molecules having lower binding affinities 
off of the solid support. 

[01 32] Alternatively, for a homogeneous liquid binding reaction admixture, a secondary binding means specific for 
5 the biologically active molecule can be utilized to bind the molecule and provide for its separation from the binding reac- 
tion admixture. 

[0133] For example, an immoWized anttoody immunospedfic for the biologically active molecule can be provided 
as a solid phase-affixed antibody to the binding reaction admixture after the birxfing reaction complex is formed. The 
immobi&zed antfoody immunoreacts with the biologically active molecule present in the birring reaction admixture to 
10 form an antftxxfy-biologically active molecule imrnunoreaction complex. Thereafter, by separation of the solid phase 
from the binding reaction admixture, the imrnunoreaction complex, and therefor any binding reaction complex, is sepa- 
rated from the admixture to form isolated brfunctionai moiecula 

[0134] Alternatively, a binding means can be operative^ linked to the biologically active molecule to fiactftate its 
retrieval from the binding reaction admixture. Exemplary binding means are one of the following high affinity pairs: 
is biotirvavidin, protein A-Fc receptor, ferntm-rnagnetic beads, and the fika Thus, the biologically active molecule is oper- 
ative^ linked (conjugated) to biotin, protein A, ferritin and the like bincfing means, and the binding reaction complex is 
isolated by the use of the corresponding binding partner in the solid phase, e.g., solid-phase avidin, solid-phase Fc 
receptor, solid phase magnetic beads and the like. 

[0135] The use of solid supports on which to operativety Dnk proteinaceous molecules is generally well known in 
20 the art Useful solid support matrices are well known in the art and include cross-linked dextran such as that available 
under the tradename SEPHADEX from Pharmacia Rne Chemicals (Piscataway, NJ); agarose, borosilicate, polystyrene 
.... or latex beads about 1 micron to about 5 millimeters in diameter, polyvinyl chloride, polystyrene, cross-linked polyacry- 
lamide, nitrocellulose or nylon-based webs such as sheets, strips, paddles, plates microliter plate wells and the like 
insoluble matrices. 

25 

3. Dflermininq the Identifier Sequence 

[0136] The nucleotide sequence of the identifier oligonucleotide present in the isolated bifunctionat molecules is 
determined to identify the species of chemical moiety that participated in the preselected binding interaction or catalytic 
30 reaction. 

[01 37] Although conventional DN A sequencing methods are readily available and useful for this determination, the 
amount and quality of isolated brfunctionai molecule may require additional manipulations prior to a sequencing reac- 
tion. 

[0138] Where the amount is low, it s preferred to increase the amount of the identifier o&gonucleotide by polymer- 

35 ase chain reaction (PCR) using PCR primers directed to the primers P1 and P2 present in the identifier oligonucleotide. 
[0139] In addition, the quality of the isolated Afunctional molecule may be such that multiple species of brfunctionai 
molecule are co-isolated by virtue of similar capacities for birring to the biologically active moiecula In cases where 
more than one species of bifunctional molecule are isolated, the different isolated species must be separated prior to 
sequencing of the identifier oligonucleotide. 

40 [01401 Thus in one embodiment the i fi ffe f wi l identifier oligonucleotides of the isolated bifunctional molecules are 
cloned into separate sequencing vectors prior to determining their sequence by DNA sequencing methods. This is typ- 
ically accomplished by ampffying all of the different identifier oligonucleotides by PCR as described herein, and then 
using the unique restriction endonudease sites on the amplified product as shown in Figure 1 to directionally clone the 
arnptified fragments into sequencing vectors. The cloning and sequencing of the amplified fragments then is a routine 

45 procedure that can be carried out by any of a number of molecular biolo g ica l methods known in the art 

[0141] Alternatively, PCR amplified products derived from a population of isolated brfunctionai molecules can be 
used as a rtybrkfization probe to selective using 
the hybridization probes; which are mooWied by biotinytation as shown in Figure 1, one can isolate members of the 
Itorary by hybridization, to form an enriched I bran/ containing only bif u nctio nal molecules that have sequences that 

so hybridize to the above hybr^ In a second screening reaction under different bending conditions, for exam- 

pie. higher stringency binding condrtion s , one can isolate the species of Afunctional molecule that binds most tightly 
with the btotogicaly active molecule. 
[0142] Thus the Ebr^ can rje manipulated to form 

55 4. Polymerase Cham Reaction 

[0143] For determining the nucleotide sequence of the identifier oigonudeotide in the isolated complex as part of 
the methods of this invention, the use of the polymerase chain reaction (PCR) is a preferred embocfiment 
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[0144] For use in this invention, the identifier oligonucleotide are comprised of polynucleotide coding strands, such 
as mRNA and/or the sense strand of genomic DNA. If the genetic material to be assayed is in the form of double 
stranded DNA, it is usually first denatured, typically by melting, into single strands. The nucleic acid is subjected to a 
PGR reaction by treating (contacting) the sample with a PCR primer pair, each member of the pair having a preselected 

5 nucleotide sequence. The PCR primer pair is capable of initiating primer extension reactions by hybridizing to the PCR 
primer binding site on nucleotide sequences of the identifier nucleotide, preferably at least about 10 nucleotides in 
length, and more preferably is at least about 20 nucleotides in length. The first primer of a PCR primer pair is sometimes 
referred to herein as the "anti-sense primer" because it hybridizes to a non-coding or anti-sense strand of a nucleic acid, 
i.e., a strand complementary to a coding strand. The second primer of a PCR primer pair is sometimes referred to - 

w herein as the "sense primer" because rt hybridizes to the coding or sense strand of a nucleic acid. 

[0i45] The PCR reaction is performed by mixing the PCR primer pair, preferably a predetermined amount thereof, 
with the nucleic acids of the sample, preferably a predetermined amount thereof, in a PCR buffer to form a PCR reaction 
admixture. The admixture is thermocycted for a number of cycles, which is typically predetermined, sufficient for the for- 
mation of a PCR reaction product thereby enriching the sample to be assayed for the identifier oligonucleotide in the 

75 isolated complex. 

[0146] PCR is typically carried out by thermocycling i.e.. repeatedly increasing and decreasing the temperature of 
a PCR reaction admixture within a temperature range whose lower Gmit is about 30 degrees Celsius (30°C) to about 
55°C and whose upper limit is about 90°C to about 100°C. The increasing and decreasing can be continuous, but is 
preferably phasic with time periods of relative temperature stability at each of temperatures favoring polynucleotide syn- 

20 thesis, denaturation and hybridization. 

[0147] A plurality of f irst primer and/or a plurality of second primers can be used in each amplification, e.g., one spe- 
cies of first primer can be paired with a number of different second primers to form several different primer pairs. Alter- 
natively, an individual pair of first and second primers can be used In any case, the amplification products of 
amplifications using the same or different combinations of first and second primers can be combined for assaying for 

25 mutations. 

[0148] The PCR reaction is performed using any suitable method. Generally it occurs in a buffered aqueous solu- 
tion, i.e., a PCR buffer, preferably at a pH of 7-9. most preferably about 8. Preferably, a molar excess of the primer is 
admixed to the buffer containing the template strand. A large molar excess is preferred to improve the efficiency of the js. 
process. 

30 [0149] The PCR buffer also contains the deoxyribonudeotide triphosphates (polynucleotide synthesis substrates) 
dATP, dCTP. dGTP, and dTTP and a polymerase, typically thermostable, all in adequate amounts for primer extension ^ 
(polynucleotide synthesis) reaction. The resulting solution (PCR admixture) is heated to about 90°C • 100°C for about - 
1 to 1 0 minutes, preferably from 1 to 4 minutes. After this heating period the solution is allowed to cool to 54°C, which 
is preferable for primer hybridization. The synthesis reaction may occur at from room temperature up to a temperature v, 

35 above which the polymerase pnducing agent) no longer functions efficiently. Thus, for example, if DNA polymerase is - 
used as inducing agent the temperature is generafiy no greater than about 40°C. The thermocycling is repeated until 
the desired amount of PCR product is produced An exemplary PCR buffer comprises the following: 50 mM KCt; 1 0 mM 
Tris-HCI at pH 8.3; 1 .5 mM MgCfe; 0.001% (wt/vof) gelatin, 200 \lM dATP; 200 jlM dTTP; 200 jiM dCTP; 200 |iM dGTP; 
and 2.5 units Thermus aquaticus (Taq) DNA polymerase I (U.S. Patent Na 4.889,818) per 100 microliters Gil) of buffer. 

40 [0150] The inducing agent may be any compound or system which will function to accomplish the synthesis of 
primer extension products, including enzymes. Suitable enzymes for this purpose include, for example, E. opli DNA 
polymerase I, Klenow fragment of E. coi DNA polymerase I, T4 DNA polymerase; other available DNA rxrymerases, 
rewerse transcriptase, and other enzyme nucle- 
otides in the proper manner to form the primer extension products which are complementary to each nucleic acid 

45 strand. Generally, the synthesis will be initiated at the 3* end of each primer and proceed in the 5* direction along the 
template strand, unti synthesis terminates, producing molecules of different lengths. There may be inducing agents, 
however, which initiate synthesis at the 5* end and proceed in the above d^ 
abova 

[0151 ] The inducing agent also may be a cornpound or system which will function to accomplish the synthesis of 
so RNA primer extension products, including enzymes. In preferred ernbodiments, the inducing agent may be a DNA- 
dependent RNA polymerase such as T7 RNA polymerase; T3 RNA polymerase or SP6 RNA polymerase These 
potyrrerases produce a conpler^ 

the starting polynucleotide as has been described by Chamberlin et aL, The Enzymes, ed. P. Beyer, pp. 87-1 08, Aca- 
demic Press, New York (1982). Amplification systems based on tr an scription have been described by Gingeras et al.. 
55 in PCR Protocols. A Guide to Methods and Apo fc afogL pp 245-252. Innis et at. eds. Academic Press. Inc., San 
Diego, CA(1990). 

[0152] If the inducing agent is a DNA-dependent RNA polymerase and. therefore ■lcorpora tes rfoonudeotide tri- 
phosphates, sufficient amounts of ATP. CTP, GTP and UTP are admixed to the primer extension reaction admixture and 
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the resulting solution is treated as described above. 

[0153] The newly synthesized strand and its complementary nucleic acid strand form a double-stranded molecule 
which can be used in the succeeding steps of the process. 

[0154] PCR amplification methods are described in detail in U.S. Patent Nos. 4,683,192, 4,683.202, 4,800,159. and 
4,965,188, and at least in several texts including PCR Technology: Principles and Applications for DNA Amplification. 
H. Ertich, ed., Stockton Press, New York (1989); and PCR Protocols: A Guide to Methods and Applications. Innis et a)., 
eds., Academic Press, San Diego, California (1990). 

[01 55J The term "polynucleotide" as used herein in reference to primers, probes and nucleic add fragments or seg- 
ments to be synthesized by primer extension is defined as a molecule comprised of two or more deoxyribonudeotides 
or ribonucleotides, preferably more than three. Its exact size will depend on many factors, which in turn depends on the 
ultimate conditions of use. 

[0156] The term "primer" as used herein refers to a polynucleotide whether purified from a nucleic add restriction 
digest or produced synthetically, which is capable of acting as a point of initiation of nucleic acid synthesis when placed 
under conditions in which synthesis of a primer extension product which is complementary to a nucleic acid strand is 
induced, i.a, in the presence of nudeotides and an agent for polymerization such as DNA polymerase, reverse tran- 
scriptase and the like, and at a suitable temperature and pH. The primer is preferably single stranded for maximum effi- 
ciency, but may alternatively be in double stranded form. If double stranded, the primer is first treated to separate it from 
its complementary strand before being used to prepare extension products. Preferably, the primer is a polydeoxyrbo- 
nudeotide. The primer must be suffidentty long to prime the synthesis of extension products in the presence of the 
agents for polymerization. The exact lengths of the primers will depend on many factors, inducting temperature and the 
source of primer. For example, depending on the complexity of the target sequence, a poiynudeotide primer typically 
contains 15 to 25 or more nudeotides, although it can contain fewer nudeotides. Short primer mdecules generally 
require cooler temperatures to form suffidently stable hybrid complexes with template. 

[0157] The primers used herein are selected to be 'substantially" complementary to the different strands of each 
specific sequence to be synthesized or amplified. This means that the primer must be sufficiently complementary to 
non-randomty hybridize with its respective template strand. Therefore, the primer sequence may or may not reflect the 
exact sequence of theterrplata For example, a non^xxriplemerrtary rtudeotide fragment can be attached to the 5' end 
of the primer, witfi the remainder of the primer sequence being substantially complementary to the strand. Such non- 
conrplemerttary fragments typically code for an endonudease restriction sita Alternatively, non-complementary bases 
or longer sequences can be interspersed into the primer, provided the primer sequence has suff tdent cornplementarity 
with the sequence of the strand to be synthesized or ampified to non-randornfy hybrkfize therewith and thereby form an 
extension product under poiynudeotide synthesizing concfitions. 

[0158] Primers of the present invention may also contain a DNA-dependent RNA polymerase promoter sequence 
or its cornplement See for example, Krieg et a)., Nuct Acids Res.. 12:7057-70 (1984); Studier et ai., J. Moi. Biol.. 
189:113-130 (1986); and Molecular Ctoning; A laboratory Martini frmyt Rlffion Maniatis et ah, eds.. Cold Spring 

Harbor, NY (1989). 

[0159] When a primer containing a DNA-dependent RNA rx)tyrnerase promoter is used, the primer is hybridized to 
the pofyrnxteotide strand to bear RNA pdymerase 

promoter is completed using an indudng agent such as E_fi* DNA pdymerase I, or the Klenow fragment of E. cdi 
DNA pdymerase. The starting poiynudeotide is amplified by alternating between the production of an RNA poiynude- 
otide and DNA poiynudeotide. 
[0160] Primercm^ also contain a terr^^ 

asa Typical RNA-drected RNA polymerase include the QB repficase descrtoed bv Lizardi et aL Bjotechnotofly. 6:1 197- 
1 202 (1 988). RNA-durected polymerases produce large numbers of RNA strands from a small number of template RNA 
strands that contain a template sequence or replication initiation site. These polymerases typically give a one miflion- 
fakf arryxiflca^ofthetan^x^stnrt as has been described by Kramer et aL. J. Mol. Biol.. 89:719-736 (1974). 
[0161] The poiynudeotide primers can be prepared using any suitable method, such as, for example, the phospho- 
triester or phosphodiester methods see Narang et aL, Meti. EnzvmoL. 68:90. (1979); U.S. Patents No. 4.356,270, No. 
4,458,066. No. 4.416.988, No. 4,293,652; and Brown etaL Mettt EnzvmoL 68:109. (1979). 
[0162] » the nucleic acid sample is to be enriched for the identifier oigonudeofdeirt the iscxa^ 
arnplrHcation. two primers. Leu a PCR primer pair, must be used for each coding strand of nucl eic acid to be amplfied. 
The first primer becomes part of tte non-coding (anti-Gense or minus or corr^emerttary) strand and hybridizes to a 
rtudeotide sequence on the plus or cooing strand Gocond primers become part of the cooing (sense or plus) strand 
and hybridize to a nucleotide sequence on the minus or norwxxSng strand. One or both of the first and second primers 
can contain a rtudeotide sequonco defining an eiwlonurionso recognition site as described herein. The site can be het- 
erologous to the polymer identifier oligonucleotide being amptified. 

[01631 In one embooiment the present invention utifizes a set of porynudectides that form primers having a priming 
region located at the 3Merminus of the primer. The priming region is typically the 3 ? -most (3'-terminaJ) 1 5 to 30 nude- 
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otide bases. The 3'-terminal priming portion of each primer is capable of acting as a primer to catalyze nucleic acid syn- 
thesis, i.e., initiate a primer extension reaction off its 3' terminus. One or both of the primers can additionally contain a 
^-terminal (S'-most) non-priming portion, i.e., a region that does not participate in hybridization to the preferred tem- 
plate. 

5. Nucleic Acid Sequence Analysis 

[0164] Nucleic acid sequence analysis is a well known procedure tor determining the sequence of nucleotides and 
is applied to the present methods to determine the nucleotide sequence in an identifier oligonucleotide or PCR amplifi- 
cation product of this invention. Nucleic acid sequence analysis is approached by a combination of (a) physiochemtcal 
techniques, based on the hybridization or denaturation of a probe strand plus its complementary target and (b) enzy- 
matic reactions with endonudeases, ligases; and polymerases. 

[0165] In assays using nucleic acid hybridization, detecting the presence of a DNA duplex in a process of the 
present invention can be accomplished by a variety of means. 

[0166] In one approach tor detecting the presence of a DNA duplex, an oligonucleotide that is hybridized in the DNA 
duplex includes a label or indicating group that wll render the duplex detectable. Typically such labels include radioac- 
tive atoms, chemically modified nucleotide bases, and the like. 

[0167] The oligonucleotide can be labeled, i.e., operatively linked to an indicating means or group, and used to 
detect the presence of a specific nucleotide sequence in a target template. 

[0168] Radioactive elements operatively linked to or present as part of an oligonucleotide probe (labeled oligonu- 
cleotide) provide a useful means to facilitate the detection of a DNA duplex. A typical radioactive element is one that 
produces beta ray emissions. Elements that emit beta rays, such as 3 H 14 C, 32 P and 35 S represent a class of beta ray 
emission-producing radioactive element labels. A radioactive polynucleotide probe is typically prepared by enzymatic 
incorporation of radioactively labeled nucleotides into a nucleic acid using DNA kinase. 

[0169] Alternatives to radioactively labeled oligonucleotides are oligonucleotides that are chemically modified to 
contain metal compiexing agents, biotin-containing groups, fluorescent compounds, and the like. 
[0170] One useful metal compiexing agent is a lanthanide chelate formed by a lanthanide and an aromatic beta- 
diketone, the lanthanide being bound to the nucleic acid or oligonucleotide via a chelate-forming compound such as an 
EDTA-analogue so that a fluorescent lanthanide complex is formed. See U.S. Patent No. 4,374. 1 20, No. 4,569,790 and 
published Patent Application Noa EP0 139675 and WO87/02708. 

[0171] Bictin or acridine ester-labeled oligonucleotides and their use to label polynucleotides have been described. 
See U.S. Patent No. 4,707,404, published Patent Application EP0212951 and European Patent No. 0087636. Useful 
fluorescent marker compounds include fluorescein, rhodamine, Texas Red, NBD and the like. 
[0172] A labeled oligonucleotide present in a DNA duplex renders the duplex itself labeled and therefore distin- 
guishable over other nucleic acids present in a sample to be assayed. Detecting the presence of the label in the duplex 
and thereby the presence of the duplex, typically involves separating the DNA duplex from any labeled oligonucleotide 
probe that is not hybridized to a DNA duplex. 

[0173] Techniques for the separation of single stranded oligonucleotide, such as non-hybridized labeled oligonucle- 
otide probe, from DNA duplex are well taown, and typically involve the separation of single stranded from double 
stranded nucleic acids on the basis of their chemical propertiea More often separation involve the use of a 

heterogeneous hybridization format in which the norWiybridized probe is separated, typically by washing, from the DNA 
duplex that is bound to an ireolutte matrix. Exenplary is to in which the matrix is a nrtrocei- 

lufose sheet and the label is 32 P. Southern, J. Mol Biol. 98:503 (1975). 

[0174] The oligonucleotides can also be advantageously linked, typically at or near their SMerminus, to a sofid 
matrix, i.e.. aqueous insoluble solid support as previously described. 

[0i75] It is also possible to add "finking" nucleotides to the 5* or 3' end of the member ofigonucteotide, and use the 
linking oligonucleotide to operatively link the member to the solid support 

[0176] In nucleotide hybridizing assays, the hybridaatfon reaction mixture is maintained in the contemplated 
method under hybriolzing conrJtions for a time period sufficient for the oligonucleotides having complementarity to the 
predetermined sequence on the template to hybridce to complementary nucleic acid sequences present in the tem- 
plate to form a hybridization product i-&, a cornptex containing oligonucleotide and target nucleic acid. 
[01 77] The phrase Tiybridizing conditions" and its gr ammatica l equivalents, when used with a maintenance time 
period, indicates subjecting the hybridization reaction admixture, in the context of the coricentrations of reactants and 
accompanying reagerits in to 

nucleotides to anneal with the target sequence, to form a nucleic acid duplex. Such time; temperature and pH condi- 
tions requred to accon^plishrty^^ 

riybrirJzed, the degree of cornplemerttority between the oligonucleotide and the target the guanine and cytosine con- 
tent of the oligonucleotide, the stringency of hybridization desired, and the presence of salts or additional reagents in 
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the hybridization reaction admixture as may affect the kinetics of hybridization. Methods for optimizing hybridization 
conditions for a given hybridization reaction admixture are well known in the art 

(0178] Typical hybridizing conditions include the use of solutions buffered to pH values between 4 and 9, and are 
carried out at temperatures from 4°C to 37°C, preferably about 12°C to about 30°C, more preferably about 22°C, and 
tor time periods from 0.5 seconds to 24 hours, preferably 2 minutes (min) to 1 hour. Exemplary are the conditions 
described in Example 4. 

[0179] Hybridization can be carried out in a homogeneous or heterogeneous format as is well known. The homo- 
geneous hybricization reaction occurs entirely in solution, in which both the oligonucleotide and the nucleic acid 
sequences to be hybridized (target) are present in soluble forms in solution. A heterogeneous reaction involves the use 
of a matrix that is insoluble in the reaction medium to which either the oligonucleotide, polynucleotide probe or target 
nudeic acid is bound. 

[0180] Where the nucleic acid containing a target sequence is in a double stranded (ds) form, it is preferred to first 
denature the dsONA, as by heating or alkali treatment prior to conducting the hybridization reaction. The denaturation 
of the dsDNA can be carried out prior to admixture with a oligonucleotide to be hybridized, or can be carried out after 
the admixture of the dsONA with the oligonucleotide. 

[0181] Effective amounts of the oligonucleotide present in the hybridization reaction admixture are generally well 
known and are typically expressed in terms of molar ratios between the oligonucleotide to be hybridized and the tem- 
plate. Preferred ratios are hybridization reaction mixtures containing equimotar amounts of the target sequence and the 
oligonucleotide. As is well known, deviations from equal molarity will produce hybridization reaction products, although 
at lower efficiency. Thus, although ratios where one component can be in as much as 100 fold molar excess relative to 
the other component, excesses of less than 50 fold, preferably less than 1 0 fold, and more preferably less than two fold 
are desirable in practicing the invention. 

Examples 

[0182] The following examples are intended to illustrate, but not limit the present invention. 
1 Preparation of Protected Amino Acids 

[0183] The synthesis of a brfunctional molecule requires protected amino acids. The amino-terminus of the amino 
acid is protected with f luorenylmethacycarbonyl (FMOC) and the carboxy-terminus is protected with a perrtaf luorophe- 
nyl ester (Opfp). The amino acids lysine, cysteine, tyrosine, serine, threonine, arginine, histicfine, tryptophan, aspartic 
acid, and glutamic acid, require additional protection of their side chains (R groups). 

[0184] Most of the FMOC and Opfp protected amino acids are corrtrnercialty available and were obtained from 
Bachem Biosciences, Inc. (Philadelphia, PA). The terminology used herein for their structure is indicated by the follow- 
ing example for glycine (Gfy): FMOC-Gly-Opfp, where FMOC and Opfp are the amino and carboxy terminal protecting 
groups. For side chain protection, the following protected amino acids are avaflabie from Bachem: FMOOArg (MTr)- 
Opfp having the substituent r^-4-methoKy^3,6^imethylbenzene suHonyi arginine (MTr) at the side chain amino ter- 
minus of arginine; FMOC-His(Bum)-Opfp having the substituent ^-tert-butoxyrrtetriylrtstidine (Bum) at the heterocyclic 
reactive nitrogen in Nstidine; FMOC-Cys(Trt)-Opfp having the substituent S-tnphenyl methyl cysteine at the side chain 
sulfur of cysteine; FMOC-Trp(N-Fbf)-Opfp havtngaformyi group at the amino group of tryptophan's heterocyclic group; 
and FMCK>Lys(N-Bz)-Opt> having a benzyl group on the free amino group of lysine s side chain; where the structure 
in parenthess indicates the protecting group on the reactive side chain. 

[0185] FMOC-Tyr(OTBS)-Opfp having a tertbutyfcfimetriystlyl^ on the side chain hydroxy of tyrosine is 

prepared by reacting an excess of formic acid with FMOC~Tyr(tert-butyf)-Opfp (Bachem) to remove the tert butyl group 
from the protected hydroxy! group to form FMOC-Tyr-Opfp. Thereafter, one equivalent of FMOC-Tyr-Opfp is reacted 
wWi 1.2 equivalents of TBS-CI and 1 .5 equivalents of imidazole tn OCM at room temperature for 1 2 hours under inert 
atnwspheretoformFMO(>Ty^ 

[0186] FWOC-Seit0TBS)-Opfp is simlarty prepared using FMOC-Set(teft-butyl)-Opfp (Bachem) in the reaction. 

FMOOThr(OTBS)-Opfp is also prepared in this manner using FMO<>Thr(tert43utylH)p%) (Bachem). 

[0187] FMOC-Asp(TMSE)-Op4> having a trtnethytsiyt ethyl ester (TMSE) on the side chain carbaxyt group of 

asparfcacti is prepared by first reect^ (Bachem) wim 1.5 equivalents of 2- 

trinriethytsflyletrtandandl.Se 

under inert atmosphere to form FMOC-Asp(OTM$E)-f>tertbutyL There after the TMSE ester is reacted with an excess 
of formic acid at room temperature for 14 hours to hydroiyze the tertbutyl moiety and form a free carboocy! terminus in 
the form of FMOC-Asp(OTMSE)-COOH. The formic acid is evaporated, and the 1 equivalent of the rema ini ng amino 
acid is admixed with 1.1 equivalent of perrtafluu ophenoi (pip; Bachem) and 1.1 equivalent of DCCfor 12 hours at room 
temperature under inert atrnosphere to form the product FMOC-Asp(TMSE)-Oprpi The product is isolated from unre- 
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acted pfp. DCC and precursor amino acid by silica gel chromatography using 10% (v/v) ethyl acetate in hexane. 
[0188] FMOC-Glu(TMSE)-Opfp having a TMSE ester on the side chain carboxyl group of glutamic acid is prepared 
as described above to prepare FMOC-.TMSE- and pfp protected aspartic acid, except that FMOC-Glu-O-tert-butyl 
(Bachem) is used in place of the aspartic acid precursor, to form FMOC-Gu(TMSE)-Opfp. 

5 

2. Preparation of Solid Support 
a. Teflon Support 

10 [0189] A solid support designated NS^S^^Iimethyoxytrityl-^.a'-diacetyi 1 -adenytyO-tef Ion support was obtained 
from Gen Research (Herndon, VA). The solid support is a teflon resin with a modified adenine nucleoside having the 
solid support lintage through the 6-amino group of a purine base, a dimethoxytrrtyi ether (DMT) at the 5* position of the 
ribose ring and acetate esters at the 2' and 3* positions of the ribosering. The sofid support was admixed with 5 volumes 
of 3% (v/v) cfichtoroacetic acid in dichloromethane (3% DCA in DCM) and maintained for 10 minutes at room tempera- 

is ture under inert atmosphere to remove the dimethoxytrrtyi (DMT) protecting group to form a free Stiydroxyl. The result- 
ing deprotected solid support was washed 3 times with DCM to remove the excess unreacted DCA. The deprotected 
washed teflon solid support is ready for coupfing to linker. 

3. Coupling of Linker to Solid Support 

20 

a. Teflon Support 

[0190] One equivalent of deprotected washed solid support (prepared in Example 2) was admixed with 20 equiva- 
lents of a linker designated (l-dimetrwxytrrtytaxy-3-flu^ metrx>xycarbonylaiTinoprcpan-2y1) 

25 diisopropyQ-phosphoramkirte and referred to as 5' Branched-Modrfier C3 (or 5BMC3; available from Glen Research) 
and 20 equivalents of tetrazole (0.45M in acetonitrfle, Gen Research) and was agitated with stirring for 0.5 to 1 hour at 
room temperature to under inert atmosphere. The admixture was then washed with an excess of acetonrbile to remove 
unreacted reagents. The washed solid phase material was retained and admixed with 2 equivalents of Iodine in tetrahy- 
drofuran/water. 9:1 , (available from Glen Research) under inert atmosphere and maintained at room temperature for 1 0 

30 minutes to form oxidized solid-support coupled linker. 

[0191] One equivalent of oxidized solid support-coupled linker was then admixed with 20 equivalents of ace- 
tonitrile/acetic anhydride. 88:12, (capping reagent; Glen Research) for 10 minutes at room temperature to cap any unre- ; 
acted free hydroxyts present on the solid support and form capped teflon-solid support-coupled linker. 

as b. Controlled Fore Qtoss (CPQ) Stroon 

[0192] A controlled pore glass (CPG) support and linker is particularly preferred that can also be used in the 
present invention. The complete structure of an exemplary CPG support is shown in Figure 3. and includes (1) a sar- 
cosine linker moiety connecting the linker to the CPG support (2) a succirty»-arnirK)hexanol (inker that is deavabie by 
40 aqueous ammonia to release the potypeptide^kerKJtigonudeotide conjugate (A-B-C) from the solid support and (3) 
a serine branch-rrxromer having a phototabde cleavage site for releasing the oligonucleotide from the solid support 
The synthesis is carried out stepwise as described below. 

[0193] N-Fmoo-amidotiexan-1-ol (1) was first prepared to form the arranohCKanoi linker moiety. To that end, 6- 
Amino-1 -hexanol (0.75 g. 6.4 mmof) was dissolved in sat aq. UafiQi (10 ml) and cooled on ice. 9-Ruorenytrnethyi 

45 chtoroformate (Fmoc-CI. 1.83 g, 7.1 mmoi) in THF (25 ml, freshly distiUed) was added slowly under vigorous stirring. 
The solution is acfcified with 1 0% (w/v) citric acid, extracted with ethyl acetate and filtered through Cefte 545 before 
separation of tie phases. The organic phase was dried (Na2S0 4 ), tittered, and evaporated to yield the crude product 
(2.15 g). Pve product was obtaind by reaystai^ 
The yield of alcohol 1 was 1.6 g. 

so [0194] N-Fmoc-an*dohexan-1-y1 succinate (2) was prepared to form the sucrinyl-arninohexanol linker moiety. 
The alcohol 1 (758 mg, 2.2 mmoO was dcsolved in arth. pyridine (5 ml) and evaporated to dryness in vacuo. The evap- 
orated oil was redissotved in anh. pyridine (3 ml) and succinic anhydride (229 mg, 2.3 rnrnol). 4-dbr>ethytarrin<x3yridin 
(DMAR 12.5 mg. 0.1 rnrnol), and dttsopropytethytarrine (DIPEA. 370 ni, 278 mg, £2 rrtmol) was added with stirring 
under an inert atmosphere (Ar). Tne reaction mixture was stirred over nigfit, evaporated to dryness* redissorved in 

55 dicrforomethane (100 mJ) and extracted wrth dilute aq. hy dr ochl ori c acid (1 M, 50 mQ. The organic phase was dried 
(Na^^. fitered, and evapcxat 

talized from 2-propanol. The yield of compound 2 was 735 mg (as write crystals), having a melting point (K%x) of 67- 
68*C (uncorr.); and a mass spectroscopy (MS) (El, m/e) 440 (calcd for CssHggNOs + H 440). 
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[0195] 0-(4 ) 4 , -Dimethoxytrityl)-N-Fmoc-L-Serine (3) was prepared to form the serine-branch linker moiety con- 
taining the FMOC and O-DMT termini. Fmoc-Z. -Serine (4.04 g, 12.2 mmol) was dissolved in anh. pyridine (10 ml) and 
evaporated to dryness in vacuo. This process was repeated twice. The evaporated oil was redissolved in anh. pyridine 
(12 ml) and 4 t 4'-dimethoxytrityl chloride (4.20 g, 1 2.4 mmo!) was added with stirring at room temperature under an inert 
atmosphere (Ar). The reaction mixture was stirred over night evaporated to dryness, redissolved in chloroform (4 x 25 
ml) and extracted with sat. aq. NaHC0 3 (50 mi). The aq. phase was back-extracted once with chloroform (25 mi) and 
the combined organic phases were dried from IS^SO* filtered, and evaporated to yield the crude product as a brown- 
ish oil. The oil was dissolved in chlorofonn/ethylacetate and triturated once with hexane and then purified by silica gel 
column chromatography (150 ml silica) using CHCtj/MeOH/DIPEA, 94/5/1 , as the mobile phase. The fractions were 
analyzed by tic and the appropriate fractions were pooled and evaporated to give a white foam designated compound 
3. The yield of compound 3 was 4.29 g, and the 
MS (El, m/e) was 630 (caicd for C^H^NO? + H 630). 

[0196] Controlled pore glass (CPG) was activated for coupling to the above prepared linkers. To that end, CPG 
(Sigma G5019, aminopropyt-CPG, 4.97 g) was suspended in trichloroacetic acid in dichtoromethane (3%, 20 ml) and 
agitated in a shaker for 4 hours. The CPG was isolated by filtration, washed three times with dichloromethane (20 ml), 
three times with DIPEA in chloroform (10%, 20 ml) and three times with diethyl ether to form activated CPG. The acti- 
vated CPG was dried in vacuo. 

[01 97] Afunctional CPG [bf-CPG, CPG-Sar-suc-aho-Ser(0-DMT)(\-Fmoc)] was then prepared comprising ail 
of the element shown in Figure 3. 

i. Activated CPG (2.0 g, loading 83 jimole/g) was placed in a filter funnel (25 ml). Fmoc-Sar (31 3.5 mg, mmol) was 
dissolved in DMF (2.8 ml), mixed with pyBOP (525.9 mg, mmol) in DMF (2.8 ml) and the combined, activated amino 
acid mixture was added to the CPG. DIPEA (1 ml) was added to the reaction mixture and the CPG snaked for 1 
hour at room temperature. Chloroform (5 ml) was added and the shaking continued for another 3 hours. The Sar- 
functionalized CPG was recovered by filtration and washed with DMF (2 x), dchlorornethane (2 x), diethyl ether (2x) 
and dried in vacuo. To obtain maximal loading, this process was repeated to give CPG-Sar with a loading of 77 
^mole/g. Eventual remaining free amino groups were capped with acetic anhydride/DMAP followed by an extensive 
washing and drying in vacuo to form Sar-functionalized CPG. 

ii . The Sar-functionalized C PG (2.0 g) was treated with pipericfine in DMF (2/8, 2x5 minutes) and reacted with com- 
pound 2 (450 mg), pyBOP (575 mg) and DIPEA (1000 ul) in DMF (2 x 2.8 ml). The resulting Sar-suc-aho-Fmoc- 
functionalized CPG was recovered by filtration and washed with DMF (2 x), cfichtoromethane (2 x). diethyl ether (2x) 
and dried in vacuo. 

iii. The Sar-suc-ahthFmoc-functionafized CPG was treated with piperidtne in DMF (2/8, 2x10 minutes) and reacted 
with c»mpound 3 (764 mg), pyBOP (536 mg) and DIPEA (1000^1) in DMF (2 x 2.8 ml) to form a brfonctional CPG 
support or "bf-CPG". The loading was determined to be 61 .7 jimole/g as judged from the Fmoc/feiperidne adduct 
absorption at 302 nm. 

[0198] In an addrbonai embodr/nent, the above bf-CPG is further functionaized by adding a photolabfle cleavage 
site for removal of the polypeptide from the conjugate. 

[01991 To that end, 3-nttro-4-bfOtnoro^ acid is prepared as described by Rich et a!., J. Am. Chem. 

Soc.. 97:1575-1579 (1975) using a-bromoiHoluic acid (4-brornomethubenzoic acid; commercially avalaUe from 
Aidrich Chemical Company, Milwaukee. Wl). The 3^rtro-4^broir o i^e lh yfoenzo^ acid moiety is added to the above bf- 
CPG support by first debtod ong the r+terrrinaJ function by a treatment with pipericfine in DMF (2/8, 2x5 minutes), then 
washed thoroughly and reacted with the 3-nfeo4-bromomet^^ acid that is activated with pyBOP to form a 
photodeavabte bf-CPG hawing the structure shown in Figure 3. The completeness of the coupling reaction is monitored 
by a positive and a negative Kaiser test after the FMOC deprotectfon and the coupling reaction, respectively. 
[0200] Coupfing of an amino acid residue Monomer to the phocteavable bf-CPG (also referred to as o-NB-bf-CPG) 
is carried out bv the esterificaioM procedure described by ffich et aL. supra, and by Barany et al._ J. Ant Chem. Soc. 
107:493*4942(1985) 

[0201 ] The resulting bf-CPG (inker/support molecules have novel and useful features in the context of the present 
invention described hereinbelow. 

(1) The tinker/support molecule has an aqueous ammonia sensitive deav^ 

which is stable to Fmoc p eptide synthesis conditions* and which upon cleavage releases the peptide-oBgonude- 
otide conjugate free from me sold CPG support thereby alowtng for the proo^jctiono^ 

chemical Itorary. A soluble phase chemical Ovary is particularly suited for use in PCR reactions, and for measuring 
peptide utei actions in solution phase bending interactions. 

(2) The L-serine brarxiwrionorner renders the peptxte-otigonucJeotide conjugate tsomericaily pure. 
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(3) The photolabile linker/support molecule (o-NB-bf-CPG) has a light sensitive (hv) cleavage site, indicated by the 
arrow in Figure 3, that is sensitive to light of about 350 nm and which upon cleavage results in the release ol soluble 
polypeptide. Cleavage can be regulated by the time and intensity of irradiation using, for example, a Rayonet RPR 
Reactor as described by Barany et al., supra . 

4. Coupling of Nucleotide to Solid Support 

a. Dgprqtert DMT 

[0202] The capped teflon soOd support-coupled linker prepared in Example 3 was admixed with 3% DCA in DCM 
fa 10 minutes at room temperature under inert atmospheres to remove the DMT protecting group from the linker and 
form a free hydroxy! group. The deprotected linker/support was then washed 3 times with DCM. The deprotected 
linker/support is ready for addition of nucleotide. 

b. Addition of Nucleotide to Teflon Support 

[0203] One equivalent of deprotected linker/support was admixed with about 20 equivalents of a desired blocked 
nucleotide phosphoramidite and 20 equivalents of tetrazole (0.45M in acetonrtriie) to form a coupled nude- 
otide/linker/support (coupled nucleotide complex). The coupled nucleotide complex was then washed with an excess of 
acetonrtriie to remove unreacted reagents. All blocked nucleotide phosphoramidttes were obtained from Gen Research 
and contain a DMT protected blocked 5trydroxyl, a cyanoethyl ester (CNE) and a diisopropylarrene group at the 3'- 
phosphoramid'rte. In addition, the adenine and cytosine derivatives contained a benzoyl group on the base's free nitro- 
gen and the guanosine derivative contains an isobutyl group on the 2-arrino group of the purine base. 
[0204] One equivalent of coupled nucleotide complex was then admixed with 2 equivalents of Iodine in tetrahydro- 
furarvwater, 9:1 , for oxidation as before in Example 3 to oxidize the coupled nucleotide complex. 
[0205] Thereafter, one equivalent of oxidized nucleotide complex was admixed with 20 equivalents of capping rea- 
gent as before in Example 3 to cap any unreacted free hydroxyis and to form capped sofid phase-coupled nucleotide 
complex. 

c. Coupling of Nucleotide to afunctional CPG 

[0206] As a demonstration of coupling chemistry for adcfition of a nucleotide residue to the btfunctiona] CPG sup- 
port we have synthesized and coupled an ofigodeoxyrfconucleotide onto bf-CPG. To that end, oOgonucleotides were 
synthesized on a ABI 394 DNA synthesizer using the standard 1 fimole scale synthesis cycle and commercially availa- 
ble reagents and phosphorarnkfites. The solid support used was the bf-CPG (20 mg, 1 fimoJ) placed in a comrnerciafly 
available empty synthesis column for the ABI synthesizers. Repetitive yield were calculated to 98.9% as judged from 
collecting the detritylation, deluting in p-toluene sulfonic acid rnonohydrate in acetonitril (0.1 M) and measuring the 
absorptions at 498 nm 

[0207] The oligonucleotide was released from the support by cone, aq. ammonia (shaking for more than 24 hours). 
The ofgonucleotide was isolated using commercially available OPOcartridges and analyzed by HPLC and PAGE (radi- 
olabeled with ^P-g-ATP and T4-kinase). 

5. Courting of Amino Acid to Solid Support 

a. Dgprqtert NuylgqWe Complex 

[0208] One equivalent of capped teflon solid phase-coupled nucleotide complex was admixed with 1 equivalent of 
1,adK^3k^ck)^A0]undeG-7-m (DBU; available from AWrich Chemical Co., MOwaukee, Wl) in DCM under inert 
atmosphere for 10 minutes at room temperature to remove (deblock) the fluoromethaxycarbony! (FMOC) protecting 
group from the linker in the nucleotide complex. The deprotected nucleotide complex was then washed with excess 
DCM to remove unreacted DBU and form a deprotected nucleotide complex with a free amino group- 

b. Addition of Amino Acid 

[0209] One equivalent of deprotected nucleotide complex from Example 5A was admixed with 20 equivalents of 
protected amino acid in tf methyfformarrride (DMF) and 20 equivalents of 1-hydraxy-benzotriazole (HOBt) under inert 
atmosphere for 0.5 to 1 hour at room temperature. This reaction condition couples the rarbaxy-terminus of the amino 
acid via its pentaf lucrophenyl ester to the free amino group of the nucleotide complex to form a nudectidetemino acid 
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conjugate (the conjugate). The conjugate was then washed in excess DCM to remove unreacted HOBt and the precur- 
sor amino acid. The protected amino acid is one of those as described in Example 1, having FMOC and Opfp at the 
amino and carboxy terminus, and if needed, a blocking group on the side chain as described before. 

c. Coupling of Amino Add to Bifunctional CPG 

[021 0] As a demonstration of coupling chemistry for addition of an amino acid residue to the bifunctional CPG sup- 
port we have synthesized a peptide. H-HPQFVS-aho, and coupled the peptide to W-CPG. To that end, bf-CPG (428 
mg. 26 jimo!) was placed in a filter funnel hooked up in a agitating device of an device. Reagents used were the com- 
mercially available Fmoc amino acid derivatives either as Opfp esters (for Val and HisJFmocD or the free acid acivated 
wth pyBOP (for Phe, Gn and Pro). Coupling was conducted in the automated peptide synthesizer using conventional 
chemistry. The success of the coupling reaction was rnonrtored by a positive and a negative Kaiser test after the Fmoc 
deprot action and the coupling steps, respectively. 

(021 1] The peptide was deprotected and released from an aliquot of the bf-CPG by treatment with /. piperidine in 
DMF (2^8, 2x5 minutes) ii. cone, aq. ammonia (shaking for 24 hours). The peptide was analyzed by reversed phase 
HPLC as one major peak. MS (Ion Spray, m/e) 813 (calcd for C 39 H 60 N 10 O9 + H 813). 

6. Elongation of the Conjugate 

[021 2] The conjugate can be lengthened by alternating cycles of addition of nucleotides and amino acids. The fol- 
lowing alternating cycles are repeated until the conjugate has desired length amino acid polymer and oligonucleotide 
polymer. 

a. Addition of Nucleotides 

[021 3] To couple an additional nucleotide, the 5XDH on the terminal nucleotide is deprotected with DCA following 
the protocol desenbed previously in Example 4A for the deprotection of the teflon Gnker/support Thereafter, a protected 
nucleotide is added as desenbed in Example 4B. 

b. Addition of Amino Acids 

[021 4] To couple an additional amino acid, the arrtino-FMOC terminus of the last amino acid added to the conjugate 
is deprotected with DBU as described previously in Example 5A. Thereafter, a protected amino acid is added as 
described in Example 5B. 

[0215] The cycle of Steps 1 and 2 above adding alternate nucleotides and amino acids can be repeated until the 
conjugate has polymers of the desired length and structure. 

c. Assembly of PorvDeotkje<)lioor^eotkte CPG Conjugates 

[021 61 As a demonstration of coupling chemistry for preparation of a pepttie^igonudeotide conjugate on a bifunc- 
tional CPG support we have synthesized and coupled an digodeaxyrtomudeotide onto bf-CPG- To that end, oSgonu- 

cleotides (SEQ ID NO : TCT CTC TCT AAA CTC GGG TCT CTC 1 ; and SEQ ID NO AGC TAC TTC CCA 

AGG ATC ACC ACA CTA GCG GGG CCC TAT TCT TW#) were synthesized on a ABI 394 DNA synthesizer using the 
standard 1 jimote scale synthesis cycle and oommerdalry available reagents and phosphoramidites according to stand- 
ard synthetic chemistries. The soid support used was the Fmoc^Fmoc)HTOFVS(DMT>aho-CPG (20 mg, 1 firnoT) pro- 
duced above in Example that was placed tn a corrmercialry available empty synthesis column for the ABI 

synthesizers. Repetitive yield were calculated to respectively 97.9% (24-mer) and 9&3% (45-mer) as judged from col- 
lecting tfTedetritylatkxu (0.1 M) and measuring the absorp- 
tions at 498 nm. 

[0217] TTm 45-mer oigoructectid^ 

a iridcfle 15-mer that is the coding sequerm tar the riptide tising(art^ ATC 
for H, ACC for P. ACA for Q and GCG for V. 

[0218] The peptkie-o6gor»xdectide conjugates formed were realesed from the support by cone, aq. ammonia 
(shaking for more than 24 hours). The peptide-oliopnucieotkte conjugates were isolated using commercially available 

1 24~Mer test sequence 

2 45-Mer sequence. The two flartong 15-mers are PCR primers arxi the rrtiddte 15-mer is the (»*ng sequence fa the pepode 
using (abirarty) the cornrnatess codons CTA tor F f ATC tor H, ACC for P, ACA tor Q and GCG (or V. 
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OPC-cartridges and analyzed by HPLC and PAGE (radiolabeled with 32 P-g-ATP and T4-Wnase). 
d. Synthesis of a Peptide-Oligonucleotide Conjugate Library on a bf-CPG Support 

[0219] Using a btfixictional (bf) CPG support as described in Example 3B, a peptide-oligonucleotide conjugate 
Ibrary is constructed as described below in the following steps: 

Step 1 . The solid support, bf-CPG, is placed in a commercially available empty synthesis column for use in an ABI 
394 DNA synthesizer (Applied Biosystems, Inc., Foster City. CA). The 3 - oligonucleotide primer sequence (SEQ 
ID NO ) is synthesized and then chemically attached to the bf-CPG support according to standard manufac- 
turer's procedures using the standard synthesis cycles and commercially available reagents and phosphora- 
midites. 

Step 2. The support after Step 2 is divided into a number of aliquots representing the number of different monomer 
amino acids to be added at Ibrary position 1 of the polypeptides to be synthesized. Each Fmoc-amino acid residue 
Monomerp ) is coupled according to a protocol consistent with the protection group on the monomer. In a typical 
protocol bf-CPG is placed in a filter funnel hooked up in a agitating device. The reagents used are the commercially 
available Fmoc amino acid derivatives either as O-pfp esters or the free acid acivated with pyBOP. The functional- 
ized bf-CPG is the treated with piperidine in DMF (2/8. 2x5 minutes) and reacted with the monomer, activated as 
described above. The success of the coupling reaction is monitored by a positive and a negative Kaiser test after 
the Fmoc deprotection and coupling steps, respectively. Likewise, the absorption at 302 nm from the deprotected 
Fmocfciperidme conjugate acts as an indicator that the coupling reaction proceeeds succesfuily. The Monomer(1)- 
functionalized bf-CPG is recovered by filtration and washed with DMF (2 x), dichloromethane (2 x), diethyl ether 
(2x) and dried in vacuo to form Monomer(1)-conjugated bf-CPG. 

Step 3. The oligonucleotide sequence that is the unit identifier for the appropriate Monomer(l) is attached to each 
of the Monomer(1)-functionalized bf-CPG's by placing each of the different Monomer(1)-fonctionalized bf-CPG's in 
separate commercially available empty synthesis columns for the DNA synthesizers. The encoding oligonucleotide 
sequence [eg., a comaless trinucleotide unit designated Oligo(1)] that corresponds to the species of Monomer^) 
present on that Monomer(1)-functionalized bf-CPG is chemically attached to the Monomer(1)-functionalized bf- 
CPG by 3 consecutive synthesis cycles on the DNA synthesizer as described previously using the standard syn- 
thesis cycles and commercially available reagents and phosphoramidrtes. to form a Monomer(1). Ofigo(1)-bf-CPG 
conjugate 

Step 4. The different Monomer(1)-functionalized bf-CPG's are pooled, mixed and divided into a nimber of aliquots 
representing the number of different amino acid residue monomers to be added in the Hxary at position 2. Mono- 
mer(2) is attached to the CPG as descrfoed under Step 2 for Monomer(t). 

Step 5. The oligonucleotide sequence that is the unit identifier for the appropriate Monomer(2) is attached to each 
of the MonomenM )-functionalized bf-CPG's by placing each of the different Monomer(1 )-functionalized bf-CPG's in 
separate commercially available empty synthesis columns for the DNA synthesizers. 

These steps. Steps 4*5. are repeated until the appropriate number of monomers and corresponding unit iden- 
tifier oligonucleotides have been incorporated. 

Step 6. The combined solid supports are placed in a commercially available empty synthesis column for the DNA 
synthesizers. The ^-oligonucleotide primer sequence is chemically attached to the Monomer(n) -Monomer(1 )-func- 
tionalized bf-CPG as described previously using the standard synthesis cycles and commercially avafeble rea- 
gents and phosphorami di t e s. The final DMT-group is left attached to the oligonucleotide-peptide conjugates to be 
usedasanaffin^taginthepuriTcationstepi 

Step 7. The combinatorial peptide-oOgonudeotide Ibrary is deprotected and released from the support by treat- 
ment first with TBAF for the appropriate time, and then with cone, aq. ammonia (shaking for more than 24 hours). 
The peptide-oligonucleotide conjugates were isolated and purified using comma c ialy available OPC-cartridges 

7. Removal <yf Protecting Qrqupg 

[0220] After complete synthesis of one or more brfuncticnal molecules, the protecting groups are removed from the 
terminal nucleotide; from the terminal amino acid, and from the side chains of protected amino acids. 

a. Removal pf Nup<9PW? Prgteflrnq Qrpup 

[0221] The DMT protecting group on the 5'-OH of the last nucleotide of the oligonucleotide polymer is removed with 
DCA following the protocol descrfoed previously in Example 4A. 
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b. Removal of Amino Acid Protecting Group 

[0222] The FMOC protecting group on the anruno-terminus of the last amino acid of the amino acid polymer is 
removed with DBU as descrtoed previously in Example 5A. 

5 

c. Removal of Amino Acid Side Chain Protecting 

[0223] Conditions for removal of an amino acid side chain protecting group depends on the particular protecting 
group as follows: 

70 

i. Removal of TBS and TSE ester Groups, 

[0224] One unit of the conjugate is admixed with about 20 equivalents of tetrabutylamrronium fluoride (TBAF) in 
DCM and maintained at room temperature under inert atmosphere overnight to remove the TBS or TMSE ethers pro- 
fs tecting the side chains of tyrosine, aspartic acid, glutamic acid, serine, and threonine. 

ii. Removal of the BzQroup 

[0225] The conjugate is admixed with an excess of aqueous ammonia and maintained at 60°C overnight under 
20 inert atmosphere to remove the benzyl (Bz) group protecting the side chain amino group of lysine. 

iii. Removal of the MTr. Bum and Tpm Groups 

[0226] The conjugate is admixed with 20 to 50 percent TFA and maintained at room temperature for about 5 min- 
25 utes under inert atmosphere to remove the MTr, Bum or Tpm groups protecting the side chains of arginine, histidine or 
cysteine, respectively. Thereafter, the conjugate is neutralized with triethanolamine and CHCI3. 

iv. Removal of the Fbrmvt Group 

30 [0227] The conjugate is admixed with aquec^ buffer at pH 12 and maintained 

utes under inert atmosphere to remove the formyl group protecting the reactive 2-amino group of tryptophan. 

8. Cleavage of Coniuoate fr om Solid Support 

35 a nteavagfl nrf th* Toflng Syppffrt 

[0228] After the protecting groups are removed from the conjugate on the teflon support the brfunctional molecule 
is removed from the solid support by admixing the conjugate with a cleaving solution of 1 00 mM socfium periodate, 100 
mM sodium phosphate buffer, pH 7.2, in acetonitriie/water (1 :4 v/v). The admixture is maintained with agitation at room 

40 temperature with exclusion of light After 4 hours of agitation, the liquid phase removed and the solid support swashed 
with excess water and methanol The wash solutions are then removed and 1 umole of solid support are admixed with 
50 ii n-propytamine, 100 ul acetonitriJe and 400 ut water and main ta in ed at 55°C for 3 hours. Thereafter, the liquid 
phase is recovered, evaporated to dryness in vacua and the dried product is dissolved in acetonrtri le/water. The ab- 
solved product is purified using reverse phase HPLC on an EM IJCnrospher 100RP-1 8m 50um column (4x25) HPLC 

45 coluim.ThernobSephaseAis96%0.1 TEAA buffer (pH 7.0) and 5 % acetonitriie, and mobile phase B is 5 % TEAA 
buffer (pH 7.0) and 95 %acetorii^ 
1 mLper min. The homogeneous fraction is collect 

[0229] The solution is (fiafyzed as needed to change the buffer of the purified material. 

so b. Cleavage of y~ M-q**? Svmt 

[0230] The peptideKXlgpnudeotide conjugate coupled to the bf-CPG support can be cleaved in two Afferent loca- 
tions. 

[0231] As shown in Figure 3, an aqueous anroria reaction cleaves the sarrostne-succinyi Onker moiety, thereby 
55 reieasmg the peptideHjfigonudeo 

[0232] Asalsoshc^inFigiffe3,irratfafm 

the serine biBnch-monomer moiety, thereby releasing the polypeptide from the solid support 

[0233] In both cases, the solid support can readBy be separated from the released soluble, material by ffltration to 
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form isolated soluble conjugate or polypeptide, depending upon the cleavage reaction. 
9. Preparation of a Library off afunctional Molecules 

[0234] Using the synthetic procedures off Examples 1-8, the methods tor producing a bifunctional molecule are 
detailed. To form a library of molecules, additional manipulations are required. First, the synthesis is conducted includ- 
ing the steps of aliquoting, adding cf ffferent units to each aliquot and pooling the aliquots to sequentially build the library. 
Second, if desired, the PCR primer binding sites and the unit identifier oligonucleotides can be added as presynthe- 
sized blocks rather than added nucleotide by nudeotida 

a. Synthesis of Protected Oligonucleotides 

[0235] Using this procedure, PCR primer binding site oligonucleotides P1 and P2 were synthesized having the 
nucleotide sequences shown in Table 2, but having an DMT at the ofigonucleotide's 5' terminus, and having a ONE ester 
and an amino diisopropyl phosphorarridate at the oligonucleotide's 3' terminus. Similarly, unit identifier oligonucleotides 
were synthesized for glycine (GHy) and methionine (Met) having 6 nucleotides per unit and having the blocked termini 
described above. The unit identifier oligonucleotide sequences are shown in Table 2. 



TABLE 2 


Designation 


Oligonucleotide Sequence 


P1 


5'-GGGCCCTATTCTTAG-3' 


P2 


S-AGCTACTTCCCAAGG-S' 




S-CTCAT03 f 




S-ACGGTA-3* 



b. Synthesis of a Library 

[0236] The synthesis of a prototype library is described where the chemical unit is an amino acid, the alphabet size 
is 2, being comprised of glycine and methionine, the unit identifier nucleotide sequence is 6 nucleotides in length, and 
the chemical polymer length is three amino acids in length. A schematic of the process is &hown in Figure 2. 
[0237] The solid support prepared in Example 2 is coupled to the linker as described in Example 3. For conven- 
ience, the solid support-coupled linker molecule is referred to as LINK. Thereafter, protected oligonucleotide P1 is cou- 
pled to LINK as described for a single protected nucleotide in Example 4 to form the structure P1-LINK. 
[0238] In Step 1 . P 1 -LINK is divided into two aBquots The first aliquot is subjected to the sequential coupling of the 
amino acid residue glycine as described in Example 5, and then coupling of the protected oligonucleotide Z 9 ^ as 
described in Example 4 to form the structure CTCATG-P1 -LINK-gly. The second aliquot is similarly coupled to add the 
amino acid methionine and the oligonucleotide Z 17 * 1 to form the structure ACGG1A-P1 -LINK-met The two aliquots are 
then pooled to form a mixture of the two Wunctional molecules. 

[0239] In Step 2, the pool from Step 1 is divided into two aliquots. The first aEquot is subjected to a sequential cou- 
pling as before, adding glycine and the oligonucleotide to form the structures: 

CTCATGCTCATQ-PI-LINK-gly.gly , and 
CTCATGACGGTA-P1 -LINK-metgty . 

The second aliquot is subjected to a sequential coupling as before, adding methionine and the oligonucleotide Z™" to 
form the structures: 

ACQQTACTCATG-P1-LINK-gly.met ( and 
ACGGTAACGGTA-P1 -LINK-metmet . 

The two aliquots are then pooled to form a mixture of the four bifunctional molecules. 

[0240] In Step 3. the pool from Step 2 is divided into two aliquots. The first aiquot is subjected to a sequential cou- 
pling as before, adding glycine and the oSgonucieotide . Thereafter, protected oligonucleotide P2 is coupled to the 
growing bifunctional molecules in the pool as descrfoed for a single pro t ected nucleotide in Example 4 to form the struc- 
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P2CTCATGCTCATGCTCATGP1-LINK-gly.gly.gly . 
P2CTCATGCTCATGACGGTAP1-LINK-met.gly.gly , 
P2CTCATGACGGTACTCATGP1 -LlNK^gly.metgly .and 
P2CTCATGACGGTAACGGTAP1 -U NK-metmetgly . 

The second aliquot is subjected to a sequential coupling as before, adding methionine and the oligonucleotide Z™ 01 . 
Thereafter, protected oligonucleotide P2 is coupled to the growing bifunctionai molecules in the pool as described for a 
single protected nucleotide in Example 4 to form the structures: 

P2ACGGTACTCATGCTCATGP1 -UNK-gty.gfy.met . 
P2ACGGTACTCATGACGGTAP1 -LINK-metgly.met , 
P2ACGGTAACGGTACTCATGP1 -U NK-gly. met met ,and 
P2ACGGTAACGGTAACGGTAP1 -UNK-met.metmet . 

The two aliquots are then pooled to form a mixture of the eight bifunctionai molecules, which is then further divided and 
subjected to further sequential coupling steps to produce a library according to the invention. 
[0241] By increasing the alphabet size one increases the number of aliquots per step. 

[0242] The foregoing specification, including the specific embodiments and examples, is intended to be illustrative 
of the present invention and is not to be taken as limiting. 

Claims 

1 . A bifunctionai molecule according to the following formula: 

(X n )-(.0-X4-X3-X r X r B-Z r Z r Z3-Z 4 .(...)KZ n ) 

wherein: 

X,,, X 1t X 2 , X 3 , and X4 are each single amino acid residues that together comprise a polypeptide, 

Z,,, Z 1t Zg, Z$, and Z4 are each nucleotide sequences of the same length, the length being from 2 to 10 nucle- 
otides, which correspond to and identify X„ f X 1( X 2 , X 3 , and X4 respectively; 

B is a linker molecule operativeiy linked to X t and Z 1( and 

n is a position identifier for both X and Z having a value from 4 to 50. 

2. A bifunctionai molecule according to the following formula: 

(XJK...)-X4-X*-X^ 

wherein: 



Xp, X 1t X 2 , X3, X4, Z„, Z-t, Zg, Z 3 . Z4, B and n are as defined in claim 1, and 

P1 and P2 are nucleotide sequences that provide PCR primer binding sites adapted to amplify the polymer 
identifier oOQonudeotide. 

3. The bifunctionai moleaJe of claim 2 wherein said P1 and P2 each contain a sequence that defines a restriction 
en do n u d e ase site when present in a PCR-ampiffied duplex DNA fragment 

4. The bifunctionai moiecule of claim 3 wherein said restriction sites are located proximal to said polymer identifier oi- 
gonucleotide. 

5. The bifunctionai moleaie of claim 4 wherein said restriction endonudease sites form non-overiapping cohesive 
termini upon restriction endonudease deavaga 



28 



EP 0 643 778 * . 

6. A library comprising a plurality of species of bifunctional molecules according to claim 1 or 2. 

7. The library of claim 6 wherein each of said species of bifunctional molecules in said plurality is present in mdar 
equivalents of from 0.2 to 1 0.0. 

8. A library comprising a plurality of species of bifunctional molecules according to claim 2, wherein P1 and P2 each 
have a nucleotide sequence shared by an bifunctional molecule species in the library. 

9. A method for identifying a polypeptide polymer that participates in a preselected binding interaction with a biologi- 
cally active molecule to form a binding reaction complex, the method comprising the steps of: 

a) providing a library of bifunctional molecules as described in claim 6; then 

b) admixing in solution said library of bifunctional molecules of said step (a) with the biologically active mole- 
cule under binding conditions lor a time period sufficient to form the binding reaction complex; then 

c) isolating the binding reaction complex famed in said step (b); then 

d) amplifying the identifier oligonucleotide of the bifunctional molecule within the binding reaction complex iso- 
lated in said step (c) by means of PCR; and then 

e) sequencing and decoding the identifier oligonucleotide amplified in said step (d) for identifying the polypep- 
tide that participated in the preselected binding interaction. 

10. The method of claim 9 wherein said biologically active molecule is affixed to a solid support 

11. The method of claim 9 wherein said biologically active molecule is operatively linked to a binding means capable 
of binding a binder molecule. 

1 2. The method of claim 1 1 wherein said binding means is selected from the group consisting of biotin, protein A and 
magnetic beads. 

1 3. The method of claim 9 wherein said step (d) comprises the following substeps: 

t) amplifying the identifier oligonucleotide isolated in said step (C) by a polymerase chain reaction (PCR); and 
then 

ii) sequencing and decoding the PCR amplification product of said substep (0 for identifying the polypeptide 
that participated in the preselected bincfing interaction. 

14: A method for synthesizing the lixary of bifunctional molecules in claim 6 wherein the polypeptide of each bifunc- 
tional molecule has a length of at least p amino add residues, where 50 *p* 4. and wherein the amino add rest- 
dues which comprise the polypeptides are drawn from m species, where 50&fTfe4, the method comprising the 
following steps: 

a) dispen si ng equal aliquots of a nascent linker molecule represented by the following formula: 

A'-B-C' 

into each of m reaction vessels wherein: 

A* is adapted for reaction with any of m activated amino acid residues represented by (X^n,*; and 

C is adapted tor reaction with any of m activated unit identifier nucleotide sequences represented by 
(Zt) m ';then 

b) expensing into each of them reaction vessels of said Step (a) a correspond i ng aliquot of them* activated 
amino acid residue represented by (X 1 ) m ' and a corresponding aliquot of the 111 th activated unit identifier nude- 
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otide sequence represented by (Z^* for producing a product nascent Afunctional molecule represented by 
(X^-B-fZ^then 

c) combining all of the nacsent Afunctional molecules from all m reaction vessels produced in said step (b) for 
5 producing an admixture of nascent bifunctional molecules; then 

d) dispensing equal aliquots of the admixture of nascent brfunctional molecules from the immediately prior step 
into each of m reaction vessels; then 

" e) dispensing into each of the m reaction vessels of said Step (d) a corresponding aliquot of the m* 1 activated 

amino acid residue represented by {X^ m * and a corresponding aliquot of the 111 th activated unit identifier nucle- 
otide sequence represented by (Z^ m ' for producing a elongated nascent brfunctional molecule represented by 
PWm^-WiL-^U...)-^^ wherein p^2; then 

is f) combining all of the elongated nacsent brfunctional molecules from all m reaction vessels of step (e) for pro- 

ducing an admixture of elongated nascent brfunctional molecules; and then 

g) repeating said steps d-f until n=p and the synthesis of the library is complete 

20 15. The method of claim 14 wherein said linker molecule is a bifunctional solid support selected from the group con- 
sisting of bf-CPG or o-NB-bf-CPG. 

16. The method of claim 14 wherein said linker molecule is a bifunctional solid support selected from the group con- 
sisting of bf-CPG or o-NB-bf-CPG. 

25 

1 7. An element of a library of oligopeptide/oligonucleotide conjugates comprising: 

a solid support said sofid support being of a type which is dispersible in aqueous solution, 
30 a first linkage unit coupled to said solid support. 

a second linkage unit coupled to said first linkage unit 
a bifunctional unit coupled to said second lintege unit 

35 

an oligopeptide attached to said bifunctional unit and 

an oligonucleotide attached to said bifunctional unit said oligonucleotide encoding said oligopeptide. 

40 18. An element of a library of digopepbde/oSgonudeotide conjugates as described in claim 17 further comprising a 
/ cleavable bond for coupling said first linkage unit to said second linkage unit said cieavable bond being deavable 
by expos ur e to concentrated aqueous ammonia. 

19. An element of a Iforary of d'igopeptide/bfgortucleotide corrugates as described in claim 18 further comprising a 
45 deavable bond for coupling said bifunctional unit to said oligopeptide, said deavable bond being deavable by expo- 

sure to ultraviolet fight 

20. An element of a library of ofigopeptide/ofigonucleotide conjugates comprising: 
so a bifunctional unit 

an oligopeptide attached to said bifunctional unit end 

an o fi gonucteotide attached to said bifunctional unit said ofigonudeotide comprising triplet codons which 
55 encode said ol i gopeptide. 

21. An element of a Iforary of digopeptidefoigortucleotide conjugates as described in claim 20 further comprising a 
deavable bond for coupling said brfun^^ 
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sure to ultraviolet light. 
PatentansprQche 

5 1 . Ein btfunktionales Molekul entsprechend der folgenden Forme*: 
worin: 

10 

X n , X 1( X 2 , X 3 und X4 jeweils einzelne Arrinosdurereste sind, die zusammen ein Polypeptid umfassen, 
Z„, Z 1( Z 2 , Z 3 und Z4 jeweils NuWeotidsequenzen der gletctien L&nge sind t wobei die L&nge 2 — 10 NuWeotide d 
betrdgt, welche zu X,,. X 1f X 2 , X 3 bzw. X4 korrespondieren und sie identifizieren, 
B ein linker-Molekul ist das operativ an X : und Z 1 gebunden ist und 
75 n ein Positionsiderrtif izierungsmittel fur sowohl X wie Z ist mit einem Wert von 4 bis 50. 

2. Ein btfunktionales Moleltill entsprechend der folgenden Forme): 

( XJ^..0-X 4 -X3^ 2 ^ 1 -B^1^ 1 -^ 2 -Z 3 -Z 4 -(...)--(Zn)-P2 

20 

worin: 

X„ t Xt , X 2f X 3 , X* Z n , Z t , Z 2 , Z 3 , Z4, B und n wie in Anspruch 1 def iniert sind und P 1 und P2 NuWeotidsequen- 
zen sind. die PCR-Primerbindungsatellen bereitsteilea die angepasst sind, um das Polymer-Wentifikations- 
25 Oligonukteotid zu amplifizteren. 

3. Das bifunlctionale Molekul nach Anspruch 2, worin P1 und P2 jeweils eine Sequenz enthatten, die einen Restrikti- 
ons-EndonuWease-Stefle definiert wenn diese in einem PCR-amplifizierten Duplex- DNA-Fragment vorhanden ist 

30 4. Das brfunktionale Molekul gemdss Anspruch 3, worin die Restriktions-Stellen proximal zum Polymer-ldentifikati- 
ons-OligonuWeoW lokalistert sind. 

5. Das bifunktionale Molekul aus Anspruch 4, worin die Restriktions-Endonuklease-Stellen nichtuberlappende koha- 
sive Enden nach der Restriktfons-Endorajkiease-Spalturtg bilden. 

35 

6. Eine BWiothek, die eine Vielzahl von Arten brfunktionaler Molekule nach Anspruch 1 Oder 2 umfasst 

7. Die BWiothek gem&ss Anspruch 6, worin jede der Arten brfunktionaler Molekule in der vielzahl in molaren Aquiva- 
I enter von 0,2 bis 10,0 vorhanden ist 

40 

8. Eine Bibiothek, die eine VieizaH von Arten bifunktionaler Molekule nach Anspruch 2 umfasst worin P1 und P2 
jewels eine NuMeotidsequenz besitzen. die von alen brtunktkxialen Mdekularten in der Bibfiothek geteilt wird. 

9. Ein Verfahren zur Identrfkation eines PolypeptkJ-Porymers, das an einer vorausgewdhlten Bindungswechselwir- 
45 kung mit einem biologisch aktiven Molekul teilnimmt, um einen BrtungsreakttonskompIeK zu Mden, wobei das 

Verfahren die folgenden Schritte umfasst; 

a) Bereitstelung einer Bfefiothekvonbrfunktionalen MdekOlea wie in Anspruch 6 beschrieben; dann 

so b) Mischung der BUothek von btfunktiorialen MdekOlen ausdem Schritt (a) in LOsung mrtdem biologisch akti- 

ven Molekul unter Kndungsbedingungen fOr einen Zeitraum, der zur Bildung des BindungsreaWtonstoniple- 
xes ausreicht; dann 

c) Isoiierung des in Schritt (b) gebiideten Bindungsreakttonskoniplexes; dann 

ss 

d) AmplrHkaton des IdenlHDolionsrOGgonuMeotids des brfunktionaien Mdekuls in dem Bindungsreaktions- 
tomplex, der in Schritt (c) mit Wife von PCR isoliert wurde; und dam 
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e) Sequenzierung und Dekodierung des WemHitetions-Oligonukieotids, das in Schritt (d) amplifiziert wurde, 
urn das Polypeptid, das an der vorausgewahlten Bindungswechselwirkung teilnahm, zu identrfizieren. 

1 0. Das Verfahren gemass Anspruch 9, worin das biologisch aktive MolekQI an einen feslen Trager f briert ist 

11. Das Verfahren gemass Anspruch 9, worin das biologisch aktive MolekQI operativ an ein Bindungsmittel gebunden 
wird, welches in der Lage ist, ein BindungsmoleMjl zu bind en. 

1 2. Das Verfahren gemass Anspruch 1 1 , worin das Bindungsmittel aus einer Gruppe, welche aus Biotin, Protein A und 
magnetische Kugetehen bestertt, ausgewahlt wird. 

1 a. Das Verfahren gemass Anspruch 9, worin der Schritt (d) die foigenden Unterschritte umfasst: 

i) Ampfifftation des Identrfikatiors-OligonuWeotids, das in Schritt (c) durch eine Polymerase- Kettenreaktion 
(PCR) isdiert wird; und dartn 

ii) Sequenzierung und Dekodierung des PCR-Anplifikafonsproduktes von dem Unterschrrtt (i), urn das Poly- 
peptid, das an der vorausgewahlten Bindungswechselwirkung teilgenommen hat, zu kierttrf izieren. 

14. Ein Verfahren, urn die Bibliothek von brfunktionalen Molekulen aus Anspruch 6 zu synthetisieren, worin das Poly- 
peptid von jedem brfunktionalen MolekQI eine Lange von mindestens p Aminosdureresten aufweist wobei 50 * p 2 
4 ist und worin die Aminosaurereste, die die Polypeptide umfassen. aus m Arten entnommen werden. wobei 50 S: 
m * 4 ist wobei das Verfahren cfie foigenden Schritte umfasst; 

a) Ausgabe gleicher AGquots eines Start-Linker-Molekuls, das dirch die fofgende Forme) dargesteift wird: 

A' — B— C* 

in jewels eines von m Reaktionsgefdssen. wobei: 

A* fur die Reaktion mrt jedem von m aktivierten Aminosaureresten, die durch (X^m* dargestellt werden. 
angepasst ist; und 

C fur cfie Reaktion mrt jedem von m aktivierten Eiriherts-lderttrfikatiofis-N^ die durch die 

Formel (Z^^ dargesteltt werden, angepasst ist; dam 

b) Ausgabe eines korrespondierenden Aliquots des m-ten aktivierten Aminosaurerestes, der cturch (X^* dar- 
gestellt wird und eines korresponrJerenden Aliquots der m-ten aktivierten Einheits-ldentjfikations-Nukjeotjdse- 
quenz, die durch (Z t ) m ' dargestellt wird. in jedes der m Reakbonsgefasse aus Schritt (a), um ein brfunktionaies 
Starter-Mdekul-Produkt das durch (X^— B-{Zi) m dargesteW wind, herzusteBen; dann 

c) Kornbcnation aller brfunktionaier Starter-Moiekule aus alien m ReaktionsgefaBen, die in Schritt (b) herge- 
steUt wurden, um eine Mischung von biftxiktionalen Starter-Molekulen herzustelfen; dann 

d) Ausgabe gleicher Aliquots der Mischung von bifunktionalen Molekulen aus dem geradevorausgegangenen 
Schritt in jedes der m ReaktionsgetaBe; dann 

e) Ausgabe eines korrespondierenden Afiquots des m-ten aktrvieren Aminosaurerestes, der durch (X^' dar- 
gestellt wird. und eines korresponcfierenden Aliquots der m-ten aktivierten Emherts-kfei itifikalim ts-NuMeotid- 
sequenz. die durch (ZX' dargestellt wird. in jedes der m Reaktionsgef&sse aus Schritt (d). um ein 
verl&ngertes bifunktionales Starter-Mdetoi herzustelea das durch 
(X^-^MX,),,,— B-iZiXn-C-MZJnt dargestellt wtd. worin p;>n*2ist; dann 

f) Kbrnbination after vertangerten brfunktionalen Starter-Molettfe von alen m Reaktionsgefassen aus Schritt 
(e), um eine Mischung vertangerter brfunktionaier Starter-MolekfJe herzustetlen; und dann 

g) Wiederholung der Schritted— f bis n=*> und die Synthese der BUothek vol tstandig ist 

15. Das Verfahren gemass Anspruch 14. worin das Linker-Molekul ein bifunktionaler fester Trager ist der aus der 
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Gruppe. die aus bf— CPG Oder o— NB— bf— CPG besteht, ausgewahlt wird. 

16. Das Verfahren gemass Anspruch 14, worin das Linker-Molekul ein brfunktionaler fester Trdger ist, der aus der 
Gruppe, die aus bf— CPG Oder o — NB— bf— GPG besteht, ausgewahlt wird. 

5 

17. Ein Element aus einer BiWiothek von Oligopeptid-/Oligonukieotid-Konjugaten, umfassend: 

einen festen Trdger, wobei der Trdger von einer solchen Art ist welche in wdssriger LOsung dispergierbar ist 
ekie erste Bindungseinheit, die an dem festen Trdger gekoppert ist 
10 eine zwerte Bindungseinheit cfie an die erste Bindungseinheit gekoppelt ist 

eine brfunktionale Einheit die an cfie zwerte Bindungseinheit gekoppelt ist 
ein OBgopeptid, das an der bifunktionalen Bnhert angeheftet ist, und 

effi Ofigonukteotkt das an der bifunktionalen Einheit angeheftet ist wobei das OfigonuHeotid das Oligopeptid 
todiert 

15 

18. Ein Element einer Btbliothek von O ligopeptid— /Ol igortuWeotkJ—Kbnjugaten , wie in Anspruch 1 7 beschrieben, wel- 
ches deswerteren eine spaltungsfahige Bindung fur die Kopplung der ersten Bindungseinheit an die zwerte Bin- 
dungseinheit umfasst wobei die spaltungsfahige Bindung gespaften werden kann, wenn sie konzerrtriertem 
wassrigen Ammoniak ausgesetzt wird. 

20 

1 9. Ein Element einer Btoliothek von Oligopeptid— /OligonuWeotid—Konjugaten, wie in Anspruch 1 8 beschrieben, wel- 
ches deswerteren eine spaltungsfahige Bindung fur die Kopplung der bifunktionalen Einheit an das Oligopeptid 
umfasst wobei die spaltungsfahige Bindung gespalten werden kann, wenn sie urtraviolettem Licht ausgesetzt wird. 

2S 20. Ein Element einer Bfcfiothek von OHgopeptid— /QigonuMeobd — Konjugaten, umfassend: 

eine brfunktionale Einheit 

ein OBgopeptid. das an der bifunktionalen Bnhert angeheftet ist, und 

ein OfigonuWeotid, das an der bifunktionalen Bnhert angeheftet ist wobei das OligonuWeotid Triplett-Codons - 
30 umfasst welche das Ofigopeptid kodierea 

21 . Ein Element einer BMothek von Oligopeptid— ADIigonuWeotid — Konjugaten, wie in Anspruch 20 beschrieben, wel- 
ches deswerteren eine spaltungsfahige Bindung fur die Kopplung der bifunktionalen Einheit an das Oligopeptid, ^. 
wobei die spaltungsfahige Bindung gespalten werden kann, wenn sie urtraviolettem Licht ausgesetzt wird. 

35 

Revendicatiorts 

1 . Molecule bifonctionneDe selon la tormuie suivante : 
40 (XJ-{...)-X 4 -X3-X 2 -X 1 -B-Zr22-Z3-Z4-(...HZn) 

Oil 

X, . Xg, et X4 sont chacun des rtsidus d'acides amines qui ensemble torment un polypeptide, 
45 Zn, Z 1( Z2, Z 3 , etZ4 sont chacun des sequences de nud6otides de la mame longueur, la longueur atant de 2 

a 10 nucleotides, qui correspondent a uneiderrtrta X,,, X 1( X 2 . X 3 , et X^ respectivement 
B est une moiacute de linker oparativement enchaffn6e a X 1 et Z 1t et 
n est un identificateur de position pour X et Z ayant une valeurde 4&50. 

so 2. Moiacule brfonctkxneie selon la fermie suivante 

(X n )^..0-X 4 -X3-X 2 -X r B^1-Z 1 -Z 2 -Z^-Z 4 -(..0-(Za)-P2 

ou 

55 

Xn, X 1( Xa, X 3 . X4. 2^ Z 1t Z2. Z3, Z4, B et n sont tete que d6finis a la revendkation 1 et 

P1 et P2 sont des sequences de nudaotides qui prodiisent des sites de liaison cf amorce de PCR adaptas a 

ampfif ier loligonudaotide identificateur du polymare. 
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3. Molecule bifonctionnelle de la revendication 2, ou chacun de P1 et P2 contient une sequence qui definit un site 
d'endonucJSase de restriction lors d'une presence dans un fragment <fADN en duplex amplif i6 par PGR. 

4. Molecule bifonctionneile de la revendication 3, ou lesdrts sites de restriction sont places prfcs dudit oligonucleotide 
5 identrf icateur du polymere. 

5. Molecule bifonctionnelle de la revendication 4, ou lesdits sites cfendonuciease de restriction torment des extrgmt- 
t£s ooh6srves ne se recouvrant lors dune scission par l'endonud6ase de restriction. 

10 6. Banque comprenartt un certain nombre d'esp£ces de molecules bftonctiormefles selon la revendication 1 ou 2. 

7. Banque de la revendication 6, ou chacune desdites especes de molecules bHbnctionnelles dans ladite quantity est 
presente k des Equivalents molaires de 0,2 k 10,0. 

is 8. Banque comprenant un certain nombre tfesp&ces de molecules brtonctionnelles selon la revendication 2, ou cha- 
cun de P1 et P2 a une sequence de nucleotides partagee par toutes les especes de molecules bHbnctionnelles 
dans la banque. 

9. Methode pour identifier un polymere potypeptidique qui participe dans une interaction de liaison pr£s6lectionn&e 
20 avec une molecule biologiquement active pour former un complexe de reaction de liaison, la methode comprenant 

les etapes de : 

a) produire une banque de molecules brfonctionnelles telle que decrite k la revendication 6 ; puis 

b) m&anger en solution ladite banque de molecules brfonctionnelles de ladite etape (a) avec la molecule bio- 
25 logiquement active en conditions de liaison pendant une periode de temps suff isante pour former le complexe 

de reaction de Baison ; puis 

c) isoler le complexe de reaction de liaison forme dans r&ape (b) ; puis 

d) amplifier roligonucieotide identif icateur de la molecule bifonctionnelle dans le complexe de reaction de 
liaison isoie dans ladite 6tape(c) par PCR ; et puis 

30 e) mettre en sequence et decoder roligonucieotide identificateur amplrfie dans ladite etape (d) pour identifier 

le polypeptide qui a participe k (Interaction de liaison preseiectionnee. 

1 0. Methode de la revendication 9, ou ladite molecule biologiquement active est f ix6e k un support soiide. 

35 11. Methode de la revendication 9, ou lacfite molecule biologiquement active est activement enchalnee k un moyen de 
liaison capable de Her une molecule de Kant 

12. Methode de la revendication 11, ou ledrt moyen de liaison est seiectionnedans le groupe consistant en biotine, pro- 
teine A et pertes magn&iques. 

40 

13. Methode de la revencfication 9, ou ladite etape (d) compr en d les sous-etapes survarrtes : 

i) amplifier roligonucieotide ide ntificat eur isote k ladrte etape (c) par une reaction en chatne de polymerase 
(PGR) ;etpuis 

45 ii) mettre en sequence et decoder le prodiit d'amplification par PCR de lacfite sous-etape (I) pour identifier le 

polypeptide qui a partiqp6 k (Interaction de liaison presetectiormea 

14. Methode pour la synthase de ta banque de mo*ecutes brfonctkxtneiles de la revendication 6 t ou le polypeptide de 
chaque molecule bifonctionnelle a une longueur d'au moins p r6sidus d'acides amines, ou 502p^4. et ou les r6si- 

so dus d'acides amines qui ferment les polypeptides sont pre*ev6s de m especes, ou 50&nfe4, la methode compre- 
nant les etapes sufvantes : 

a) dtstrbuer des aiquotes egales rf une molecule linker naissante representee par la formula suivante : 

55 A'-B-C' 

dans chacun des mredpcentsreactionnelsou : 
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A' est adapts a une reaction avec chacun des m rSsidus cfacides amines actives representSs par (X^* ; 

Gt 

C est adapts a une reaction avec chacune des m sequences de nucleotides ef identificateur unrtaire acti- 
vSes representees par (2i) m ' ; puis 

5 

b) distribuer dans chacun des m recipients reactionnels de ladrte Stape (a), une aliquote correspondante du 
mi&ne residu d*actde amine active represertte par (X^* et une aliquote correspondante de la midme 
sequence de nucleotides cfidentificateur active representee par (Z^) m ' pour donner une molecule brfonction- 
nelle naissante produite representee par (X^-B^Z^* ; puis 
w c) combiner toutes les molecules bifonctionneiles naissantes de tous les m recipients reactionnete produrts a 

retape (b) pour produire un melange de molecules bifonctionneiles naissantes ; puis 

d) distrfouer des aliquotes Sgales du melange des molecules bifonctionneiles naissantes de retape imm6dia- 
tement anterieure dans chacun des m recipients reactionnels ; puis 

e) distrfouer, dans chacun des m recipients reactionnels de ladrte etape (d), une aliquote correspondante du 
is mi erne r6sidu d'acide amine active represertte par (Xn) m ' et une aliquote correspondante de la mieme 

sequence de nucleotides dldentificateur active representee par (ZJ^ pour produire une molecule bifonction- 
neile naissante allong6e representee par (X n ) m .(...)-(X 1 ) m -B-(Z 1 ) m (...)-(Zn)m» <*» p2>nat2 ; puis 

f) combiner toutes les molecules bifonctionneiles naissantes allongees de m recipients reactionnels de retape 
(e) pour produire un melange de molecules bifonctionneiles naissantes allongees ; et puis 

20 g) repeter lesdrtes Stapes d-f jusqu'a ce que n=p et que la synthase de la banque soit terminSe. 

15. Methode de la revendication 14, ou ladrte molecule linker est un support solide brfonctionnel seiectionne dans le 
groupe consistant en bf-CPG ou o-NB-bf-CPG. 

25 16. Methode de la revendication 14, ou ladite molecule linker est un support solide brfonctionnel seiectionne dans le 
groupe consistant en bf-CPG ou o-NB-bf-CPG. 

17. Element cfune banque de conjugues oligopeptides/oligonucleotides comprenant : 

so un support solide, ledit support solide etant d'un type qui est dispersible dans une solution aqueuse, 

une premiere unite d'enchalnement coupiee audit support solide, 

une second e unite d'enchaTnement coupiee a ladite premiere unite d'enchalnement, 

une unite bifonctionnelle coupiee a ladite seconde unite d'enchalnement 

un oigopeptide attache a lacfite unite brtonctkxinelle, et 
35 un oligonucleotide attache a ladite unite brfonctiorviefle, ledit oligonucleotide codant pour lecfit oligopeptide. 

18. Element d'une banque de conjugues digopeptides/bigonucieotides comme decrrt a la revendication 17, compre- 
nant de plus une liaison scindable pour couplage de ladrte premiere unite d'enchaTnement a ladite seconde unite 
d'enchaTnement, lacfite liaison scindable etant scindable par exposition a de rammoniaque aqueuse concentree. 

40 

19. Element d'une banque de conjugues oligopeptides/bfigonucieotides comme decrrt a la revendication 18, compre- 
nant de plus une liaison scindable pour couplage de ladite unite brfonctionneae audit oligopeptide, ladrte liaison 
scindable etant scindable par exposition a une fumiere uttravioletta 

as 20. Element cfune banque de conjugues digcxpeptides/oltgonudeotides, comprenant 

une unite bifonctionnelle, 

un oigopeptide attache & ladrte unite bifonctionnelle, et 

un oftgonudeotide attache a lacfite unite bifonctionnelle, ledtt oligonucleotide comprenant des codons en triplet 
so qui cedent pour ledit oligopeptide. 

21 . Element d'une banque de conjugues ol i gopeptaesfoigonucieotides comme decrrt a la revendication 20, compre- 
nant de plus trie liaison scindable pour coupler ladite unite bifonctionnelle aucfit oligopeptide, ladrte liaison scinda- 
ble etant scindable par exposition a de la lurriere uttraviotette. 

55 
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PI - LINK 

| Step 1 

CTCATG-Pl-LINK-gly 

ACGGTA-Pl-LINK-met 

| Step 2 

CTCATGCTCATG-Pl-LINK-gly . gly 
CTCATG ACGGTA-P 1 - LI NK-me t . gly 

ACGGTACTCATG-Pl-LINK-gly . met 
ACGGTAACGGTA-Pl-LINK-met . met 

| Step 3 

P2 CTCATGCTCATGCTCATGPl-LINK-gly . gly . gly 
P2 CTCATGCTCATGACGGTAP1 -LINK-met . gly . gly 
P2 CTCATG ACGGTACTCATGP 1 - LINK-gl y . met . gly 
P2 CTCATGACGGTAACGGTAP 1-LINK-met . met . gly 

P2ACGGTACTCATGCTCATGPl-LINK-gly . gly . met 
P2 ACGGTACTCATGACGGTAP 1- LINK-met . gly .met 
P2ACGGTAACGGTACTCATGPl-LINK-gly . met . met 
P2 ACGGTAACGGTAACGGTAPl-LINK-met . met . met 



PI = GGGCCCTATTCTTAG 
P2 = AGCTACTTCCCAAGG 



FIG. 2 
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